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^ (54) Title: SECRETED PROTEINS AND USES THEREOF 

(57) Abstract: The invention provides isolated nucleic acid molecules, designated TANGO 253, which encode proteins contain- 
QO ing Clq domains and which are homologous to a human adipocyte complement-mediated protein precursor, TANGO 257, which 
O encode proteins homologous to the human extracellular molecule olfactomedin, a molecule important in the maintenance, growth 
§^ and differentiation of chemosensory cilia of olfactory neurons, INTERCEPT 258, which encode Ig domain-containing proteins that 
^ exhibit homology to an antigen (A33) expressed in colonic and small bowel epithelium, and TANGO 281, which encode proteins 
* — . downregulated in megakaryocytes that fail to express the gata-1 transcription factor (a factor critical for blood cell formation) and 
2 can, therefore, represent direct or indirect gata-1 targets. The invention also provides anti sense nucleic acid molecules, expression 

vectors containing the nucleic acid molecules of the invention, host cells into which the expression vectors have been introduced, and 
^5 non-human transgenic animals in which a nucleic acid molecule of the invention has been introduced or disrupted. The invention still 

further provides isolated polypeptides, fusin polypeptides, antigenic peptides and antibodies. Diagnostic, screening and therapeutic 
^ methods utilizing compositions of the invention are also provided. 
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SECRETED PROTEINS AND USES 
THEREOF 

This application is a continuation-in-part of U.S. patent application Serial No. 
5 09/336,536, filed June 1 8, 2000, the contents of which are incorporated herein by 
reference in its entirety. 

Background of the Invention 

Many secreted proteins, for example, cytokines and cytokine receptors, play a vital 

10 role in the regulation of cell growth, cell differentiation, and a variety of specific cellular 
responses. A number of medically useful proteins, including erythropoietin, granulocyte- 
macrophage colony stimulating factor, human growth hormone, and various interleukins, 
are secreted proteins. Thus, an important goal in the design and development of new 
therapies is the identification and characterization of secreted and transmembrane proteins 

1 5 and the genes which encode them. 

Many secreted proteins are receptors which bind a ligand and transduce an 
intracellular signal, leading to a variety of cellular responses. The identification and 
characterization of such a receptor enables one to identify both the ligands which bind to 
the receptor and the intracellular molecules and signal transduction pathways associated 

20 with the receptor, permitting one to identify or design modulators of receptor activity, e.g., 
receptor agonists or antagonists and modulators of signal transduction. 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of cDNA 
25 molecules which encode the TANGO 253, 257 and 281 proteins and the INTERCEPT 258 
protein, all of which are either wholly secreted or transmembrane proteins. 

The TANGO 253 proteins are Clq domain-containing polypeptides that exhibit 
homology to a human adipocyte complement-related protein precursor. 

The TANGO 257 proteins are homologous to the human extracellular molecule 
30 olfactomedin, a molecule important in the maintenance, growth and differentiation of 
chemosensory cilia of olfactory neurons. 

The INTERCEPT 258 proteins are Ig domain-containing polypeptides that exhibit 
homology to an antigen (A33) expressed in colonic and small bowel epithelium, a protein 
that may represent a cancer cell marker. 

35 
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The TANGO 281 proteins represent proteins downregulated in megakaryocytes 
that fail to express the gata-1 transcription factor (a factor critical for blood cell formation) 
and can, therefore, represent direct or indirect gata-1 targets. 

The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins, 

5 fragments, derivatives, and variants thereof are collectively referred to herein as 

"polypeptides of the invention" or "proteins of the invention." Nucleic acid molecules 
encoding the polypeptides or proteins of the invention are collectively referred to as 
"nucleic acids of the invention." 

The nucleic acids and polypeptides of the present invention are useful as 

1 0 modulating agents in regulating a variety of cellular processes. Accordingly, in one 

aspect, this invention provides isolated nucleic acid molecules encoding a polypeptide of 
the invention or a biologically active portion thereof. The present invention also provides 
nucleic acid molecules which are suitable for use as primers or hybridization probes for 
the detection of nucleic acids encoding a polypeptide of the invention. 

15 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:2, or the nucleotide sequence of the cDNA insert of an 
EpT253 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

20 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of 
SEQ ID NO:8, SEQ ID NO:9, or the nucleotide sequence of the cDNA insert of an 
EpTm253 clone deposited with ATCC® as Accession Number 207215, or a complement 
thereof. 

25 The invention features nucleic acid molecules which are at least 95% or 98% 

identical to the nucleotide sequence of SEQ ID NO:15, SEQ ID NO:16, or the nucleotide 

sequence of the cDNA insert of an EpT257 clone deposited with ATCC® as Accession 

Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which are at least 95% or 98% 
30 identical to the nucleotide sequence of SEQ ID NO:21, SEQ ID NO:22, or the nucleotide 

sequence of the cDNA insert of an EpTm257 clone deposited with ATCC® as Accession 

Number 207217, or a complement thereof. 

The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 

65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID NO:26, 
35 SEQ ID NO:27, or the nucleotide sequence of the cDNA insert of an EpT258 clone 

deposited with ATCC® as Accession Number 207222, or a complement thereof. 
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The invention features nucleic acid molecules which are at least 45%, 50%, 55%, 
65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID NO:37, 
SEQ ID NO:38, or the nucleotide sequence of the cDNA insert of an EpTm258 clone 
deposited with ATCC® as Accession Number 207221, or a complement thereof. 
5 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence 
of SEQ ID NO:46, SEQ ID NO:47, or the nucleotide sequence of the cDNA insert of an 
EpT281 clone deposited with ATCC® as Accession Number 207222, or a complement 
thereof. 

10 The invention features nucleic acid molecules which are at least 35%, 40%, 45%, 

50%, 55%, 65%, 75%, 85%, 95%, or 98% identical to the nucleotide sequence of SEQ ID 
NO:56, SEQ ID NO:57, or the nucleotide sequence of the cDNA insert of an EpmT281 
clone deposited with ATCC® as patent deposit Number PTA-224, or a complement 
thereof. 

15 The invention features nucleic acid molecules which are at least 30%, 35%, 40%, 

45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98% identical to the 
nucleotide sequence of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 
57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

20 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191or 192, a complement thereof, or the non- 
coding strand of EpT 253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA of ATCC® Accession 207222, Accession Number 207215, Accession 
207217, Accession Number 207221, or patent deposit Number PTA-224, wherein said 

25 nucleic acid molecules encode polypeptides or proteins that exhibit at least one structural 
and/or functional feature of a polypeptide of the invention. 

The invention features nucleic acid molecules of at least 450, 500, 550, 600, 650, 
700, 750, 800, 850, 900, 1000, 1 100, 1200 or 1300 contiguous nucleotides of the 
nucleotide sequence of SEQ ID NO: 1, the nucleotide sequence of an EpT253 cDNA of 

30 ATCC® Accession Number 207222, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 contiguous 
nucleotides of the nucleotide sequence of SEQ ID NO:2, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 

35 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 contiguous 
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nucleotides of the nucleotide sequence of SEQ ED NO:8 the nucleotide sequence of an 
EpTm253 cDNa of ATCC® Accession Number 207215, or a complement thereof. 

The invention features nucleic acid molecules of at least 310, 350, 400, 450, 500, 
550, 600, 650 or 700 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:9, 
5 or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 15 or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
10 1 1 50 or 1 200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO: 1 6, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
1100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:21 the nucleotide sequence of an EpTm257 cDNA of ATCC® 
1 5 Accession Number 2072 1 7, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
1 150 or 1200 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:22, or its 
complement. 

The invention features nucleic acid molecules which include a fragment of at least 
20 420, 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, or 
1 800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:26 the nucleotide 
sequence of an EpT258 cDNA of ATCC® Accession Number 207222, or a complement 
thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
25 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 

1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:27, 
or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
675, 700,800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
30 nucleotides of the nucleotide sequence of SEQ ID NO:37 the nucleotide sequence of an 
EpTm258 cDNA of ATCC® Accession Number 207221, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 
contiguous nucleotides of the nucleotide sequence of SEQ ID NO:38, or a complement 
35 thereof. 
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The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 
1600, 1700 or 1800 contiguous nucleotides of the nucleotide sequence of SEQ ID NO:46 
the nucleotide sequence of an EpT281 cDNA of ATCC® Accession Number 207222, or a 
5 complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
50, 100, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides of the nucleotide 
sequence of SEQ ID NO:47, or a complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 
10 550, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 
contiguous nucleotides of the nucleotide sequence of SEQ ED NO:56 the nucleotide 
sequence of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224, or a 
complement thereof. 

The invention features nucleic acid molecules which include a fragment of at least 

15 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides of the nucleotide sequence 
of SEQ ID NO:57, or a complement thereof. 

The invention features isolated nucleic acid molecules having a nucleotide 
sequence that is at least about 20, 50, 100, 150, 200, 250, 300, 400, 450, 500, 550, 600, 
650, 700 or more contiguous nucleotides identical to the nucleic acid sequence of SEQ ID 

20 NOS: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 
171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a complement thereof, or the non-coding strand of EpT253, 

25 EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or EpTm281 cDNA of 
ATCC® Accession 207222, Accession number 207215, Accession Number 207217, 
Accession Number 207221, or patent deposit number PTA-224, wherein said nucleic acid 
molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 

30 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 40%, 45%, 
50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of 
SEQ ID NO:3, the amino acid sequence encoded by an EpT253 cDNA of ATCC® 
Accession Number 207222, or a complement thereof. 

35 The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 95%, or 98% 
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identical to the amino acid sequence of SEQ ID NO: 10, the amino acid sequence encoded 

by an EpTm253 cDNA of ATCC® Accession Number 2071 15, or a complement thereof. 
The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 17, the amino acid 

sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222, or a 

complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 88%, 90%, 
10 95%, or 98% identical to the amino acid sequence of SEQ ID NO:23, the amino acid 

sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 2071 17, or a 

complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
15 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 

NO:28, the amino acid sequence encoded by an EpT258 cDNA of ATCC® Accession 

Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 45%, 50%, 
20 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 

NO:39, the amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession 

Number 207221, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
25 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 

sequence of SEQ ID NO:48, the amino acid sequence encoded by an EpT281 cDNA of 

ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a protein having an amino acid sequence that is at least 30%, 35%, 
30 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid 

sequence of SEQ ID NO:58, the amino acid sequence encoded by an EpTm281 of 

ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules which include a nucleotide 

sequence encoding a polypeptide or protein having an amino acid sequence that is at least 
35 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95%, or 98% identical to the 

amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, or 58, the amino acid 



WO 00/78808 



PCT/US00/16883 



sequence encoded by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281, or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 2072 1 7, or Accession Number 20722 1 , patent deposit Number PTA- 
224, or a complement thereof, wherein the polypeptide or protein encoded by the 

5 nucleotide sequence also exhibits at least one structural and/or functional feature of a 
polypeptide of the invention. 

In preferred embodiments, the nucleic acid molecules have the nucleotide sequence 
of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or the nucleotide 
sequence of the cDNA clones of ATCC® Accession Number 207222, 207215, 207217, 

10 207221, 207222, or PTA-224. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO:3, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO:3, or the amino acid sequence encoded by an EpT253 cDNA of 

15 ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 17, or a fragment including at 
least 10, 15, 20, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 230 or 240 contiguous amino 
acids of SEQ ID NO: 10, or the amino acid sequence encoded by an EpTm253 cDNA of 

20 ATCC® Accession Number 2072 15. 

Also within the invention are nucleic acid molecules which encode a fragment of a 
polypeptide having the amino acid sequence of SEQ ID NO: 10, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO: 17, or the amino 
acid sequence encoded by an EpT257 cDNA of ATCC® Accession Number 207222. 

25 Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:23, or a fragment including at 
least 360, 370, 380, 390 or 400 contiguous amino acids of SEQ ID NO:23, or the amino 
acid sequence encoded by an EpTm257 cDNA of ATCC® Accession Number 207217. 

Also within the invention are nucleic acid molecules which encode a fragment of a 

30 polypeptide having the amino acid sequence of SEQ ID NO:3, or a fragment including at 
least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350 or 360 
contiguous amino acids of SEQ ID NO:28, or the amino acid sequence encoded by an 
EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 

35 polypeptide having the amino acid sequence of SEQ ID NO:39, or a fragment including at 
least 160, 175, 200, 225, 250, 275, 300, 350, 375 or 385 contiguous amino acids of SEQ 
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ID NO:39, or the amino acid sequence encoded by an EpT258 cDNA of ATCC® 

Accession Number 207221 . 

Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:48, or a fragment including at 
5 least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200, 225, 235 or 240 contiguous amino acids 

of SEQ ID NO:48, or the amino acid sequence encoded by an EpT281 cDNA of ATCC® 

Accession Number 207222. 

Also within the invention are nucleic acid molecules which encode a fragment of a 

polypeptide having the amino acid sequence of SEQ ID NO:58, or a fragment including at 
10 least 15, 25, 30, 50, 75, 100, 125, 150, 175 or 200 contiguous amino acids of SEQ ID 

NO:58, or the amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent 

deposit Number PTA-224. 

The invention also features nucleic acid molecules which encode a polypeptide 

fragment of at least 15, 25, 30, 50, 75, 100, 125, 150, 175, 200 or more contiguous amino 
1 5 acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58, or the amino acid sequence encoded 

by EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or EpTm281 of 

ATCC® Accession Number 207222, Accession Number 207215, Accession Number 

207217, Accession Number 207221 or patent deposit Number PTA-224, wherein the 

fragment also exhibits at least one structural and/or functional feature of a polypeptide of 
20 the invention. 

The invention includes nucleic acid molecules which encode a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 10, 
17, 23,28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 

25 or 164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, wherein the nucleic acid molecule hybridizes 
to a nucleic acid molecule consisting of a nucleic acid sequence encoding SEQ ID NO:3, 
10, 28, 39, 48,58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 

30 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 
164, or the amino acid sequence encoded by a cDNA of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221 or patent deposit Number PTA-224, or a complement thereof under stringent 
conditions. 

35 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 40%, preferably 45%, 55%, 65%, 75%, 85%, 95% or 
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98% identical to the amino acid sequence of SEQ ID NO:3, or the amino acid sequence 
encoded by an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 40%, preferably 45%, 50%, 55%, 65%, 75%, 85%, 

5 95% or 98% identical to the amino acid sequence of SEQ ID NO: 10, or the amino acid 
sequence encoded by an EpTm253 cDNA of ATCC® Accession Number 207215. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%, 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ID NO: 17, or the amino acid sequence encoded by an EpT257 cDNA of ATCC® 

1 0 Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least 88%; 90%, 95% or 98% identical to the amino acid sequence 
of SEQ ID NO:23, or the amino acid sequence encoded by an EpTm257 cDNA of 
ATCC® Accession Number 20721 7. 

15 Also within the invention are isolated polypeptides or proteins having an amino 

acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:28, or the amino acid 
sequence encoded by an EpT258 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 

20 acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:39, or the 
amino acid sequence encoded by an EpTm258 cDNA of ATCC® Accession Number 
207221. 

Also within the invention are isolated polypeptides or proteins having an amino 
25 acid sequence that is at least about 30%, preferably 35%, 45%, 55%, 65%, 75%, 85%, 
95% or 98% identical to the amino acid sequence of SEQ ID NO:48, or the amino acid 
sequence encoded by an EpT281 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins having an amino 
acid sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 
30 75%, 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:58, or the 
amino acid sequence encoded by an EpTm281 cDNA of ATCC® patent deposit Number 
PTA-224. 

The invention also features isolated polypeptides or proteins having an amino acid 
sequence that is at least about 30%, preferably 35%, 40%, 45%, 50%, 55%, 65%, 75%, 
35 85%, 95% or 98% identical to the amino acid sequence of SEQ ID NO:3, 10, 1 7, 23, 28, 
39, 48 or 58, or the amino acid sequence encoded by EpT253, EpTm253, EpT257, 
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EpTm257, EpT258, EpTm258, EpT281 or EpTm281 of ATCC® Accession Number 
207222, Accession Number 207215, Accession Number 207217, Accession Number 
207221, patent deposit Number PTA-224, wherein the protein or polypeptides also 
exhibit at least one structural and/or functional feature of a polypeptide of the invention. 

5 Also within the invention are isolated polypeptides or proteins which are encoded 

by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:3, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 

10 hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO: 1 or SEQ ED NO:2, a complement thereof, or the non- 
coding strand of an EpT253 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

1 5 preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO: 10, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, a complement thereof, or the non- 
20 coding strand of an EpTm253 cDNA of ATCC® Accession Number 207215. 

Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:28, and isolated polypeptides or proteins which are encoded by a nucleic acid 

25 molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 
SEQ ID NO:27, a complement thereof, or the non-coding strand of an EpT258 cDNA of 
ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 

30 by a nucleic acid molecule having a nucleotide sequence that is at least about 45%, 50%, 
55%, 60%, 65%, 75%, 85%, 95% or 98% identical to the nucleic acid sequence encoding 
SEQ ID NO:39, and isolated polypeptides or proteins which are encoded by a nucleic acid 
molecule having a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 

35 SEQ ID NO:38, a complement thereof, or the non-coding strand of an EpTm258 cDNA of 
ATCC® Accession Number 207221. 
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Also within the invention are isolated polypeptides or proteins which are encoded 
by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 
preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:48, and isolated polypeptides or proteins 

5 which are encoded by a nucleic acid molecule having a nucleotide sequence which 

hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NO:46 or SEQ ID NO:47, a complement thereof, or the 
non-coding strand of an EpT281 cDNA of ATCC® Accession Number 207222. 

Also within the invention are isolated polypeptides or proteins which are encoded 

10 by a nucleic acid molecule having a nucleotide sequence that is at least about 30%, 

preferably 35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to 
the nucleic acid sequence encoding SEQ ID NO:58, and isolated polypeptides or proteins 
which are encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule having the 

1 5 nucleotide sequence of SEQ ID NO:56 or SEQ ID NO:57, a complement thereof, or the 
non-coding strand of an EpTm281 cDNA of ATCC® patent deposit Number PTA-224. 

The invention also features isolated polypeptides or proteins which are encoded by 
a nucleic acid molecule having a nucleotide sequence that is at least about 30%, preferably 
35%, 40%, 45%, 50%, 55%, 60%, 65%, 75%, 85%, 95% or 98% identical to a nucleic 

20 acid sequence encoding SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, isolated polypeptides or proteins which are 
encoded by a nucleic acid molecule having a nucleotide sequence which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule having the nucleotide 

25 sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 
103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
187, 188, 189, 190, 191 or 192, a complement thereof, or the non-coding strand of 

30 EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281, EpTm281 of 
ATCC® Accession Number 207222, Accession Number 207215, Accession Number 
2072 1 7, Accession Number 20722 1 , patent deposit Number PTA-224, wherein 
polypeptides or proteins also exhibit at least one structural and/or functional feature of a 
polypeptide of the invention. 

35 Also within the invention are polypeptides which are naturally occurring allelic 

variants of a polypeptide that includes the amino acid sequence of SEQ ID NO:3, 10, 17, 
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23, 28, 39, 48 or 58, or the amino acid sequence encoded by a cDNA of ATCC® 
Accession Number 207222, Accession Number 207215, Accession Number 207217 
Accession Number 207221, or patent deposit Number PTA-224, wherein the polypeptide 
is encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule having 

5 the sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a 
complement thereof under stringent conditions. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l or 
2, or an EpT253 cDNA of ATCC® Accession Number 207222, or a complement thereof. 

10 In other embodiments, the nucleic acid molecules are at least 450, 500, 550, 600, 650, 
700, 750, 800, 1000, 1 100, 1200 or 1300 contiguous nucleotides in length and hybridize 
under stringent conditions to a nucleic acid molecule comprising the nucleotide sequence 
of SEQ ID NO:l or 2, an EpT253 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. 

1 5 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:8 or 
SEQ ID NO:9, an EpTm253 cDNA of ATCC® Accession Number 207215, or a 
complement thereof. In other embodiments, the nucleic acid molecules are at least 540, 
550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 1 159, 1200, or 1250 

20 contiguous nucleotides in length and hybridize under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NO:8 or SEQ ID NO:9, an 
EpTm253 cDNA of ATCC® Accession Number 207215, or a complement thereof 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO: 15 or 

25 SEQ ID NO:16, an EpT257 cDNA of ATCC® Accession Number 207222, or a 

complement thereof and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 17, or encode a polypeptide comprising at least 360, 370, 380, 390 or 400 
contiguous amino acids or SEQ ID NO: 17. 

The invention also features nucleic acid molecules that hybridize under stringent 

30 conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:21 or 
SEQ ID NO:22, an EpTm257 cDNA of ATCC® Accession Number 207217, or a 
complement thereof, and encode a polypeptide comprising the amino acid sequence of 
SEQ ID NO:23, or a polypeptide comprising at least 360, 370, 380, 390, or 400 
contiguous amino acids of SEQ ID NO:23. 

35 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:26 or 
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SEQ ID NO:27, an EpT258 cDNA of ATCC® Accession Number 207222, or a 
complement thereof. In other embodiments, the nucleic acid molecules are at least 550, 
600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 

5 comprising the nucleotide sequence of SEQ ID NO:26 or SEQ ID NO:27, an EpT258 
cDNA of ATCC® Accession Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
SEQ ID NO:38, an EpTm258 cDNA of ATCC® Accession Number 207221, or a 

10 complement thereof. In other embodiments, the nucleic acid molecules are at least 650, 
700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous 
nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:37 or SEQ ID NO:38, an EpTm258 
cDNA of ATCC® Accession Number 207221, or a complement thereof. 

15 The invention also features nucleic acid molecules that hybridize under stringent 

conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:46 or 
47, an EpTm281 cDNA of ATCC® Accession Number 207222, or a complement thereof. 
In other embodiments, the nucleic acid molecules are at least 710, 750, 800, 900, 1000, 
1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and 

20 hybridize under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:46 or SEQ ID NO:47, an EpT281 cDNA of ATCC® Accession 
Number 207222, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:56 or 

25 57, an EpTm281 cDNA of ATCC® patent deposit Number PTA-224, or a complement 
thereof. In other embodiments, the nucleic acid molecules are at least 580, 600, 700, 800, 
. 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, 1800 or 1850 contiguous 

nucleotides in length and hybridize under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:56 or SEQ ID NO:57, an EpTm281 

30 cDNA of ATCC® patent deposit Number PTA-224, or a complement thereof. 

The invention also features nucleic acid molecules that hybridize under stringent 
conditions to a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:l, 2, 
8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 101, 103, 105, 107, 109, 111,1 13, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 

35 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a 
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nucleotide sequence of EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, 
EpT281 or EpTm281 of ATCC® Accession Number 207222, Accession Number 207215, 
Accession Number 2072 1 7, Accession Number 20722 1 , patent deposit Number PTA-224, 
or complement thereof, wherein such nucleic acid molecules encode polypeptides or 
5 proteins that exhibit at least one structural and/or functional feature of a polypeptide of the 
invention. 

The invention also features nucleic acid molecules at least 15, preferably at least 
50, at least 75, at least 100, at least 150, at least 200, at least 250, at least 300, at least 350, 
at least 400, at least 500, at least 600, at least 700, at least 800, at least 1000, at least 1 100 

10 or at least 1200 or more contiguous nucleotides in length which hybridize under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:l,2, 8, 9, 15, 16,21,22, 26, 27, 37,38, 46,47, 56, 57, 77, 80,91, 100, 101, 103, 104, 
105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 

15 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 
189, 190, 191 or 192, or a nucleotide sequence of EpT253, EpTm253, EpT257, EpTm257, 
EpT258, EpTm258, EpT281 or EpTm281 of ATCC® Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221, 
patent deposit Number PTA-224, or a complement thereof, wherein said nucleic acid 

20 molecules encode polypeptides or proteins that exhibit at least one structural and/or 
functional feature of a polypeptide of the invention. 

In one embodiment, the invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a nucleic acid of the invention. 

Another aspect of the invention provides vectors, e.g., recombinant expression 

25 vectors, comprising a nucleic acid molecule of the invention. In another embodiment, the 
invention provides host cells containing such a vector or engineered to contain and/or 
express a nucleic acid molecule of the invention. The invention also provides methods for 
producing a polypeptide of the invention by culturing, in a suitable medium, a host cell of 
the invention such that a polypeptide of the invention is produced. 

30 Another aspect of this invention features isolated or recombinant proteins and 

polypeptides of the invention. Preferred proteins and polypeptides possess at least one 
biological activity possessed by the corresponding naturally-occurring human polypeptide. 
An activity, a biological activity, or a functional activity of a polypeptide or nucleic acid 
of the invention refers to an activity exerted by a protein, polypeptide or nucleic acid 

35 molecule of the invention on a responsive cell as determined in vivo or in vitro, according 
to standard techniques. Such activities can be a direct activity, such as an association with 
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or an enzymatic activity on a second protein, or an indirect activity, such as a cellular 
signaling activity mediated by interaction of the protein with a second protein. 

For TANGO 253, biological activities include, e.g., (1) the ability to modulate (this 
term, as used herein, includes, but is not limited to, "stabilize", promote, inhibit or disrupt, 

5 protein-protein interactions (e.g., homophilic and/or heterophilic), and protein-ligand 
interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of cells of the 
central nervous system such as neurons, glial cells (e.g., astrocytes and oligodendrocytes), 
and Schwann cells; (3) the ability to modulate the development of central nervous system; 

10 (4) the ability to modulate the development, differentiation, maturation, proliferation 
and/or activity of renal cells; (5) the ability to modulate the development, differentiation, 
maturation, proliferation and/or activity of testical cells, such as germ cells, leydig cells 
and Sertoli cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity of ovarian cells; (7) ability to modulate cell-cell interactions 

15 and/or cell-extracellular matrix interactions; (8) the ability to modulate the host immune 
response, e.g., by modulating one or more elements in the serum complement cascade; (9) 
the ability to modulate the proliferation, differentiation and/or activity of cells that form 
blood vessels and coronary tissue (e.g., coronary smooth muscle cells and/or blood vessel 
endothelial cells); (10) the ability to modulate intracellular signaling cascades (e.g., signal 

20 transduction cascades); and (1 1) the ability to modulate adipocyte function. 

For TANGO 257, biological activities include, e.g., (1) the ability to modulate the 
development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons (2) the ability to modulate the development, differentiation, proliferation and/or 
activity of pulmonary system cells, e,g, lung cell types; (4) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of bone cells such as 
osteocytes, osteoblasts and osteoclasts (e.g., the ability promote the development of 
osteocytes); (5) the ability to modulate the development of bone structures such as the 
skull, the basisphenoid bone, the upper and lower incisor teeth, the vertebral column, the 
sternum, the scapula, and the femur during embryogenesis; (6) the ability to modulate the 

30 development, differentiation, maturation, proliferation and/or activity of renal cells; (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of intestinal cells such as M cells; (8) the ability to modulate cell-cell interactions 
and/or cell-extracellular matrix interactions, e.g., neuronal cell-extracellular matrix 
interactions; (9) the ability to modulate cell proliferation, e.g., abnormal cell proliferation; 

35 and (10) the ability to modulate the development, differentiation, proliferation and/or 
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activity of cells that form blood vessels and coronary tissue, e.g., coronary smooth muscle 
cells and/or blood vessel endothelial cells. 

For INTERCEPT 258, biological activities include, e.g., (1) the ability to modulate 
protein-protein interactions (e.g., hemophilic and/or heterophilic), and protein-ligand 

5 interactions, e.g., in receptor-ligand recognition; (2) the ability to modulate cell-cell 
interactions; (3) the ability to modulate the host immune response; (4) the ability to 
modulate the development, differentiation, maturation, proliferation and/or activity of 
pulmonary system cells such as bronchial cells; (5) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of renal cells; (5) the 

10 ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of cardiac cells such cardiac myocytes; (6) the ability to modulate the 
development of brown fat (e.g., the promotion of the development of brown fat); (7) the 
ability to modulate the development, differentiation, maturation, proliferation and/or 
activity of endothelial cells; (8) the ability to modulate cell proliferation, e.g., 

15 gastrointestinal tract epithelial cell proliferation; (9) the ability to modulate intracellular 
signaling cascades (e.g., signal transduction cascades); and (10) the ability to modulate 
thrombosis (e.g., the ability to facilitate the removal of blood clots) and/or vascularization 
(e.g., the promotion of vascularization). 

For TANGO 281, biological activities include, e.g., (1) the ability to modulate, 

20 e g <> stabilize, promote, inhibit or disrupt protein-protein interactions (e.g., homophilic 
and/or heterophilic), and protein-ligand interactions, e.g., in receptor-ligand recognition; 
(2) the ability to modulate cell-cell interactions; (3) the ability to modulate the host 
immune response; (4) the ability to modulate the proliferation, differentiation and/or 
activity of hematopoeitic cells (e.g. megakaryocytes); (5) the ability to modulate the 

25 development, differentiation, maturation, proliferation and/or activity of pulmonary 
system cells; (6) the ability to modulate the development, differentiation, maturation, 
proliferation and/or activity intestinal cells such as M cells; (7) the ability to modulate the 
development, differentiation, maturation, proliferation and/or activity of stomach cells 
such as cells of the gastric epithelium; (8) the ability to modulate intracellular signaling 

30 cascades (e.g., signal transduction cascades); and (9) the ability to modulate platelet 
function (e.g., the promotion of platelet aggregation). 

In one embodiment, a polypeptide of the invention has an amino acid sequence 
sufficiently identical to an identified domain of a polypeptide of the invention. As used 
herein, the term "sufficiently identical" refers to a first amino acid or nucleotide sequence 

35 which contains a sufficient or minimum number of identical or equivalent (e.g., with a 

similar side chain) amino acid residues or nucleotides to a second amino acid or nucleotide 
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sequence such that the first and second amino acid or nucleotide sequences have or encode 
a common structural domain and/or common functional activity. For example, amino acid 
or nucleotide sequences which contain or encode a common structural domain having 
about 60% identity, preferably 65% identity, more preferably 75%, 85%, 95%, 98% or 
5 more identity are defined herein as sufficiently identical. 

In one embodiment, a TANGO 253 protein includes at least one or more of the 
following domains: a signal sequence, a collagen domain and a Clq domain. 

In one embodiment, a TANGO 257 protein includes at least a signal peptide. 

In one embodiment, an INTERCEPT 258 includes at least one or more of the 
10 following domains: a signal sequence, an extracellular domain, an immunoglobulin (Ig) 
domain, a transmembrane domain, and an intracellular or cytoplasmic domain. 

In one embodiment, a TANGO 281 protein includes at least one or more of the 
following domains: a signal sequence, an extracellular domain, a photosystem II 10 kD 
phosphoprotein domain, a transmembrane domain, and an intracellular or cytoplasmic 
15 domain. 

The polypeptides of the present invention, or biologically active portions thereof, 
can be operably linked to a heterologous amino acid sequence to form fusion proteins. 
The invention further features antibodies, such as monoclonal or polyclonal antibodies, 
that specifically bind a polypeptide of the invention. In addition, the polypeptides of the 

20 invention or biologically active portions thereof can be incorporated into pharmaceutical 
compositions, which optionally include pharmaceutical^ acceptable carriers. 

In another aspect, the present invention provides methods for detecting the 
presence, activity or expression of a polypeptide of the invention in a biological sample by 
contacting the biological sample with an agent capable of detecting an indicator of the 

25 presence, activity or expression such that the presence activity or expression of a 
polypeptide of the invention is detected in the biological sample. 

In another aspect, the invention provides methods for modulating activity of a 
polypeptide of the invention comprising contacting a cell with an agent that modulates 
(inhibits or stimulates) the activity or expression of a polypeptide of the invention such 

30 that activity or expression in the cell is modulated. In one embodiment, the agent is an 
antibody that specifically binds to a polypeptide of the invention. 

In another embodiment, the agent modulates expression of a polypeptide of the 
invention by modulating transcription, splicing, or translation of an mRNA encoding a 
polypeptide of the invention. In yet another embodiment, the agent is a nucleic acid 

35 molecule having a nucleotide sequence that is antisense to the coding strand of an mRNA 
encoding a polypeptide of the invention. 
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The present invention also provides methods to treat a subject having a disorder 
characterized by aberrant activity of a polypeptide of the invention or aberrant expression 
of a nucleic acid of the invention by administering an agent which is a modulator of the 
activity of a polypeptide of the invention or a modulator of the expression of a nucleic acid 

5 of the invention to the subject. In one embodiment, the modulator is a protein of the 
invention. In another embodiment, the modulator is a nucleic acid of the invention. In 
other embodiments, the modulator is a peptide, peptidomimetic, or other small molecule. 

The present invention also provides diagnostic assays for identifying the presence 
or absence of a genetic lesion or mutation characterized by at least one of: (i) aberrant 

10 modification or mutation of a gene encoding a polypeptide of the invention; (ii) mis- 
regulation of a gene encoding a polypeptide of the invention; and (iii) aberrant post- 
translational modification of the invention wherein a wild-type form of the gene encodes a 
protein having the activity of the polypeptide of the invention. 

In another aspect, the invention provides a method for identifying a compound that 

15 binds to or modulates the activity of a polypeptide of the invention. In general, such 
methods entail measuring a biological activity of the polypeptide in the presence and 
absence of a test compound and identifying those compounds which alter the activity of 
the polypeptide. 

The invention also features methods for identifying a compound which modulates 
20 the expression of a polypeptide or nucleic acid of the invention by measuring the 
expression of the polypeptide or nucleic acid in the presence and absence of the 
compound. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, humanized, chimeric and non-human antibodies or 

25 fragments thereof, which antibodies or fragments specifically bind to a polypeptide 

comprising an amino acid sequence of SEQ ED NO: 3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 
106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 
142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 

30 EpTm281 cDNA insert of the plasmid deposited with the ATCC® as Accession Number 
207222, Accession Number 207215, Accession number 207217, Accession number 
207221, or patent deposit Number PTA-224. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

35 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 15 contiguous amino acids of the amino acid sequence of SEQ ID NO: 
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3, 10, 17,23,28, 39,48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 
126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 
162 or 164, or the amino acid sequence encoded by the EpT253, EpTm253, EpT257, 
EpTm257, EpT258, EpTm258, EpT281 or EpTm281 cDNA insert of the plasmid 

5 deposited with the ATCC® as Accession Number 207222, Accession number 207215, 
Accession number 207217, Accession number 207221, or patent deposit number PTA- 
224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

10 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

comprising at least 95% identical to the amino acid sequence of SEQ ID NO: 3, 10, 17, 23, 
28, 39,48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or 
the amino acid sequence encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, 

1 5 EpTm258, EpT28 1 or EpTm28 1 cDNA insert of the plasmid deposited with the ATCC® 
as Accession Number 207222, Accession number 207215, Accession number 207217, 
Accession number 207221 , or patent deposit number PTA-224, or a complement thereof. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including, e.g., human, non-human, chimeric and humanized 

20 antibodies, which antibodies or fragments thereof specifically bind to a polypeptide 

encoded by a nucleic acid molecule which hybridizes to the nucleic acid molecule of SEQ 
ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 
104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 

25 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 under conditions of hybridization of 6 X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65 °C. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 

30 conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 

fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
35 invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
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composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutical^ acceptable carrier. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

5 

Brief Description of the Drawings 

FIGURES 1 A-AB depict the cDNA sequence of human TANGO 253 (SEQ ID 
NO:l) and the predicted amino acid sequence of human TANGO 253 (SEQ ID NO:3). 
The open reading frame of SEQ ED NO:l extends from nucleotide 188 to nucleotide 916 

1 0 of SEQ ID NO: 1 (SEQ ID NO:2). 

FIGURE 2 depicts a hydropathy plot of human TANGO 253. Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 

15 dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; 
SEQ ID NO: 5) on the left from the mature protein (amino acids 16 to 243 of SEQ ID 
NO:3; SEQ ED NO:4) on the right. Below the hydropathy plot, the amino acid sequence 
of human TANGO 253 is depicted. 

FIGURES 3A-3B depict a cDNA sequence of mouse TANGO 253 (SEQ ID NO:8) 

20 and the predicted amino acid sequences of mouse TANGO 253 (SEQ ID NO: 10). The 
open reading frame of SEQ ID NO: 10 extends from nucleotide 135 to 863 of SEQ ID 
NO:10(SEQIDNO:9). 

FIGURE 4 depicts a hydropathy plot of mouse TANGO 253. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

25 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 

cysteine residues (cys) are indicated by short vertical lines just below the hydropathy trace. 
The dashed vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ED 
NO: 10; SEQ ID NO: 12) on the left from the mature protein (amino acids 16 to 243 of 
SEQ ID NO:10; SEQ ID NO:l 1) on the right. Below the hydropathy plot, the amino acid 

30 sequence of mouse TANGO 253 is depicted. 

FIGURE 5 depicts an alignment of the amino acid sequence of human TANGO 
253 (SEQ ID NO:3) and the amino acid sequence of mouse TANGO 253 (SEQ ED 
NO: 10). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 253 are 93.8% identical. This alignment was performed using the ALIGN 

35 program with a PAM1 20 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 
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FIGURES 6A-6B depict alignments of the amino acid sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:20; Swiss Prot Accession 
Number Q15848) and the amino acid sequence of human TANGO 253 (SEQ ID NO:3; 
6A) or mouse TANGO 253 (SEQ ID NO: 10; 6B). 6A shows the amino acid sequences of 

5 human adipocyte complement-mediated protein precursor and human TANGO 253 are 
38.7% identical. 6B shows the amino acid sequences of human adipocyte complement- 
mediated precursor procursor protein and mouse TANGO 253 are 38.3% identical. These 
alignments were performed using the ALIGN alignment program with a PAM120 scoring 
matrix, a gap length penalty of 12, and a gap penalty of 4. 

10 FIGURES 7A-7C depict alignments of the nucleotide sequence of human 

adipocyte complement-mediated protein precursor (SEQ ED NO:32; GenBank Accession 
Number A1417523) and the nucleotide sequence of human TANGO 253 (SEQ ID NO:l) . 
The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and human TANGO 253 are 29.1% identical. These alignments were performed using the 

15 ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 8A-8C depict alignments of the nucleotide sequence of human 
adipocyte complement-mediated protein precursor (SEQ ID NO:32; GenBank Accession 
Number A1417523) and the nucleotide sequence of mouse TANGO 253 (SEQ ID NO:8). 

20 The nucleotide sequences of human adipocyte complement-mediated protein precursor 
and mouse TANGO 253 are 30.4% identical. These alignments were performed using the 
ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 9A-9B depict the cDNA sequence of human TANGO 257 (SEQ ID 

25 NO: 15) and the predicted amino acid sequence of human TANGO 257 (SEQ ID NO: 17). 
The open reading frame of SEQ ID NO: 16 extends from nucleotide 88 to nucleotide 1305 
of SEQ ID NO: 15 (SEQ ID NO: 16). 

FIGURE 10 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are shown above the dashed horizontal line, and 

30 relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 21 of SEQ ID NO: 16; SEQ ID NO: 19) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO: 16; SEQ ID NO: 18) on the right. 

35 Below the hydropathy plot, the amino acid sequence of human TANGO 257 is depicted. 
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FIGURES 1 1 A-l IB depict a cDNA sequence of mouse TANGO 257 (SEQ ID 
NO:21) and the predicted amino acid sequence of mouse TANGO 257 (SEQ ID NO:23). 
The open reading frame of SEQ ID NO:21 extends from nucleotide 31 to 1248 of SEQ ID 
NO:21 (SEQ ID NO:22). 

5 FIGURE 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 

hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 

10 sequence (amino acids 1 to 21 of SEQ ID NO:23; SEQ ID NO:25) on the left from the 
mature protein (amino acids 22 to 406 of SEQ ID NO:23; SEQ ID NO:24) on the right. 
Below the hydropathy plot, the amino acid sequence of mouse TANGO 257 is depicted. 

FIGURE 13 depicts an alignment of the amino acid sequence of human TANGO 
257 (SEQ ID NO: 17) and the amino acid sequence of mouse TANGO 257 (SEQ ID 

1 5 NO:23). This alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 257 are 94.1% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 14 depicts an alignment of the amino acid sequence (SEQ ID NO:43) 
encoded by a nucleotide sequence referred to in PCT publication WO 98/39446 as "gene 

20 64", and the amino acid sequence of human TANGO 257 (SEQ ID NO: 1 7). Gene 64 
encodes a 353 amino acid residue protein that exhibits homology with the human 
extracellular molecule olfactomedin, which is though to be involved in maintenance, 
growth and/or differentiation of chemosensory cilia on the apical dendrites of olfactory 
neurons. The polypeptide encoded by gene 64 also exhibits homology to human TANGO 

25 257, which contains 406 amino acids (i.e., an additional 53 amino acids carboxy to residue 
353). The amino acid sequences of amino acid residues 1-353 of the gene 64-encoded 
polypeptide and human TANGO 257 are identical. As such, the overall amino acid 
sequence identity between the full length polypeptide encoded by gene 64, and the full- 
length human TANGO 257 polypeptide is approximately 87%. This alignment was 

30 performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURES 15 A- 15D depict an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 1 5). The nucleotide sequences of gene 64 and human 

35 TANGO 257 are 93.5% identical. It is noted, however, that among the differences 
between the two sequences is a cytosine nucleotide at human TANGO 257 (SEQ ID 
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NO: 15) position 1 146 that results in a human TANGO 257 amino acid sequence (SEQ ID 
NO: 17) of 406 amino acids as opposed to the gene 64 amino acid sequence of only 353 
amino acids (SEQ ID NO:43). Alignment of the nucleotide sequence of the gene 64 open 
reading frame and that of human TANGO 257 (SEQ ID NO: 16) show that the two 
5 nucleotide sequences are 87.2% identical. These alignments were performed using the 
ALIGN program with a PAM220 scoring matrix, a gap length penalty of 12 and a gap 
penalty of 4. 

FIGURE 16 depicts an alignment of the acid sequence of the gene 64-encoded 
polypeptide (SEQ ED NO:43) and the amino acid sequence of mouse TANGO 257 (SEQ 

10 ED NO:23). The sequences exhibit an overall amino acid sequence identity of 

approximately 81.8%. This alignment was performed using an ALIGN program with a 
PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

FIGURE 17A-17C depicts an alignment of the nucleotide sequence of gene 64 
(SEQ ID NO:66) and the nucleotide sequence of mouse TANGO 257 (SEQ ED NO:21). 

15 The two sequences are approximately 76.2% identical. Alignment of the nucleotide 
sequence of the gene 64 open reading frame and that of mouse TANGO 257 (SEQ ID 
NO:22) show that the two nucleotide sequences are 77.8% identical. These alignments 
were performed using the ALIGN program with a PAM220 scoring matrix, a gap length 
penalty of 12 and a gap penalty of 4. 

20 FIGURES 18A-18B depict the cDNA sequence of human INTERCEPT 258 (SEQ 

ID NO:26) and the predicted amino acid sequence of INTERCEPT 258 (SEQ ID NO:28). 
The open reading frame of SEQ ID NO:26 extends from nucleotide 153 to nucleotide 
1262 of SEQ ID NO:26 (SEQ ID NO:27). 

FIGURE 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 

25 hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (Cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. Below the hydropathy plot, the amino acid 
sequence of human INTERCEPT 258 is depicted. 

30 FIGURES 20A-20B depict a cDNA sequence of mouse INTERCEPT 258 (SEQ ID 

NO:37) and the predicted amino acid sequence of mouse INTERCEPT 258 (SEQ ID 
NO:39). The open reading frame of SEQ ID NO:37 extends from nucleotide 107 TO 1288 
of SEQ ID NO:60 (SEQ ID NO:38). 

FIGURE 21 depicts a hydropathy plot of mouse INTERCEPT 258. Relatively 

35 hydrophobic regions of the protein are shown above the dashed horizontal line, and 
relatively hydrophilic regions of the protein are below the dashed horizontal line. The 
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cysteine residues (cys) and potential N-glycosylation sites (Ngly) are indicated by short 
vertical lines just below the hydropathy trace. The dashed vertical line separates the signal 
sequence (amino acids 1 to 29 of SEQ ID NO:39; SEQ ID NO:41) on the left from the 
mature protein (amino acids 30 to 394 of SEQ ID NO:39; SEQ ID NO:40) on the right. 
5 Below the hydropathy plot, the amino acid sequence of mouse INTERCEPT 258 is 
depicted. 

FIGURE 22 depicts an alignment of the amino acid sequence of human 
INTERCEPT 258 (SEQ ID NO:28) and the amino acid sequence of mouse INTERCEPT 
258 (SEQ ID NO:39). The alignment demonstrates that the amino acid sequences of 
10 human and mouse INTERCEPT 258 are 62.8% identical. This alignment was performed 
using the ALIGN program with a PAM120 scoring matrix, a gap length penalty of 12 and 
a gap penalty of 4. 

FIGURE 23 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 

1 5 sequence of human INTERCEPT 258 (SEQ ID NO:28). The A33 antigen is a 

transmembrane glycoprotein and member of the Ig superfamily that may be a cancer cell 
marker. The amino acid sequences of A33 antigen and human INTERCEPT 258 are 23% 
identical. This alignment was performed using the ALIGN alignment program with a 
PAM120 scoring matrix, a gap length penalty of 12, and a gap penalty of 4. 

20 FIGURES 24A-24D depict an alignment of the nucleotide sequence of human A33 

antigen (SEQ ED NO:68; Gen Bank Accession Number U79725) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ID NO:26). These two nucleotide sequences 
are 40.6% identical. The nucleotide sequence of the open reading frame of human A33 
antigen and that of human INTERCEPT 258 are 44% identical. These alignments were 

25 performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 25 depicts an alignment of the amino acid sequence of human A33 
antigen (SEQ ID NO:67; Swiss Prot Accession Number Q99795) and the amino acid 
sequence of mouse INTERCEPT 258 (SEQ ID NO:39). These two amino acid sequences 

30 have an overall amino acid identity of 23%. This alignment was performed using the 
ALIGN alignment program with a PAM120 scoring matrix, a gap length penalty of 12, 
and a gap penalty of 4. 

FIGURES 26A-26D depict an alignment of the nucleotide sequence of human A33 
antigen (SEQ ID NO:68; GenBank Accession Number U79725) and the nucleotide 

35 sequence of mouse INTERCEPT 258 (SEQ ID NO:37). These two nucleotide sequences 
are 40% identical. The nucleotide sequence of the open reading frame of human A33 
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antigen and that of mouse INTERCEPT 258 are 43.2% identical. These alignments were 
performed using the ALIGN alignment program with a PAM120 scoring matrix, a gap 
length penalty of 12, and a gap penalty of 4. 

FIGURE 27A-27E depict an alignment of the nucleotide sequence of human 

5 PECAM-1, an integrin expressed on endothelial cells (SEQ ID NO:72) and the nucleotide 
sequence of human INTERCEPT 258 (SEQ ID NO:26). These two nucleotide sequences 
are 40.5% identical. This alignment was performed using ALIGN alignment program 
with a PAM120 scoring matrix, a gap length of 12, and a gap penalty of 4. 

FIGURE 28A-28B depict the cDNA sequence of human TANGO 281 (SEQ ID 

10 NO:46) and the predicted amino acid sequence of human TANGO 281 (SEQ ID NO:48). 
The open reading frame of SEQ ED NO:66 extends from nucleotide 65 to nucleotide 799 
of SEQ ID NO:46 (SEQ ID NO:47). 

FIGURE 29 depicts a hydropathy plot of human TANGO 28 1 . Relatively 
hydrophobic regions of the protein are above the dashed horizontal line, and relatively 

15 hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID NO:48; 
SEQ ID NO:49) on the left from the mature protein (amino acids 39 to 245 of SEQ ID 
NO:48; SEQ ID NO:50) on the right. Below the hydropathy plot, the amino acid sequence 

20 of human TANGO 28 1 is depicted. 

FIGURE 30 depicts an alignment of the amino acid sequence of photosystem II 10 
kD phosphoprotein domain (SEQ ID NO:69; GenBank Accession Number PF00737) and 
the amino acid sequence 97 to 146 of human TANGO 281 (SEQ ID NO:48). This 
alignment was performed using the ALIGN alignment program with a PAM120 scoring 

25 matrix, a gap length penalty of 12, and a gap penalty of 4. 

FIGURES 31A-31B depict the cDNA sequence of mouse TANGO 281 (SEQ ID 
NO:56) and the predicted amino acid sequence of mouse TANGO 281 (SEQ ID NO:58). 
The open reading frame of SEQ ID NO:56 extends from nucleotide 90 to nucleotide 728 
of SEQ ID NO:56 (SEQ ID NO:57). 

30 Figure 32 depicts a hydropathy plot of mouse TANGO 28 1 . Relatively 

hydrophobic regions of the protein are above the dashed horizontal line, and relatively 
hydrophilic regions of the protein are below the dashed horizontal line. The cysteine 
residues (cys) are indicated by short vertical lines just below the hydropathy trace. The 
dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ID NO:58; 

35 SEQ ID NO:59) on the left from the mature protein (amino acids 27 to 213 of SEQ ID 
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NO: 58; SEQ ID NO:60) on the right. Below the hydropathy plot, the amino acid sequence 
of mouse TANGO 28 1 is depicted. 

FIGURE 33 depicts an alignment of the amino acid sequence of human TANGO 
281 (SEQ ED NO:48) and the amino acid sequence of mouse TANGO 281 (SEQ ID 
5 NO:58). The alignment demonstrates that the amino acid sequences of human and mouse 
TANGO 28 1 are 66.5% identical. This alignment was performed using the ALIGN 
program with a PAM120 scoring matrix, a gap length penalty of 12 and a gap penalty of 4. 

Detailed Description of the Invention 

10 The TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 proteins and 

nucleic acid molecules comprise families of molecules having certain conserved structural 
and functional features. As used herein, the terms "family" or "families" are intended to 
mean two or more proteins or nucleic acid molecules having a common structural domain 
and having sufficient amino acid or nucleotide sequence identity as defined herein. 

15 Family members can be from either the same or different species. For example, a family 
can comprises two or more proteins of human origin, or can comprise one or more 
proteins of human origin and one or more of non-human origin. Members of the same 
family may also have common structural domains. 

For example, TANGO 253 proteins, TANGO 257 proteins, INTERCEPT 258 

20 proteins and TANGO 281 proteins of the invention have signal sequences. As used 
herein, a "signal sequence" includes a peptide of at least about 15 or 20 amino acid 
residues in length which occurs at the N-terrninus of secretory and membrane-bound 
proteins and which contains at least about 70% hydrophobic amino acid residues such as 
alanine, leucine, isoleucine, phenylalanine, proline, tyrosine, tryptophan, or valine. In a 

25 preferred embodiment, a signal sequence contains at least about 10 to 40 amino acid 

residues, preferably about 19-34 amino acid residues, and has at least about 60-80%, more 
preferably 65-75%, and more preferably at least about 70% hydrophobic residues. A 
signal sequence serves to direct a protein containing such a sequence to a lipid bilayer. 
Thus, in one embodiment, a TANGO 253 protein contains a signal sequence of about 

30 amino acids 1 to 15 of SEQ ED NO:3 (SEQ ID NO:5) or about amino acids 1 to 15 of SEQ 
ID NO: 10 (SEQ ID NO: 12). In another embodiment, a TANGO 257 protein contains a 
signal sequence of about amino acids 1 to 21 of SEQ ID NO: 17 (SEQ ID NO: 19) or about 
amino acids 1 to 21 of SEQ ID NO:23 (SEQ ID NO:25). In another embodiment, an 
INTERCEPT 258 protein contains a signal sequence at about amino acids 1 to 29 of SEQ 

35 ID NO:28 (SEQ ID NO:30) or about amino acids 1 to 29 of SEQ ID NO:39 (SEQ ID 

NO:41). In yet another embodiment, a TANGO 281 protein contains a signal sequence of 
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about amino acids 1 to 38 of SEQ ED NO:48 (SEQ ID NO:49) or about amino acids 1 to 
26 of SEQ ID NO:58 (SEQ ED NO:59). The signal sequence is cleaved during processing 
of the mature protein. 

In one embodiment, TANGO 253 includes at least one RGD cell attachment site. 

5 An RGD domain contains a contiguous arginine-glycine-aspartic acid amino acid 
sequence and is involved in cell-cell, cell-extracellular matrix and cell adhesion 
interactions. In a preferred embodiment, a TANGO 253 family member has the amino 
acid sequence of SEQ ID NO:3 and, preferably, a RGD cell attachment site is located at 
about amino acid positions 77 to 79. 

10 TANGO 253 family members can also include a collagen domain. As used herein, 

the term "collagen domain" refers to a protein domain containing a G-X-Y amino acid 
repeat motif, wherein the first amino acid residue is glycine and the second and third 
amino acid residues can be any residue but are preferably proline or hydroxyproline. 
Typically, a collagen domain contains at least about 3 to 5 G-X-Y repeats, and can contain 

15 about 3, 5, 8, 10, 12, 15, 20 or more continuous G-X-Y repeats. In one embodiment, a 
collagen domain can fold to form a triple helical structure. 

In one embodiment, a TANGO 253 family member includes at least one collagen 
domain having an amino acid sequence that is at least about 40%, 50%, 60%, 70%, 80%, 
90%, 95% or 98% identical to amino acids 36 to 95 of SEQ ID NO:3, which is the 

20 collagen domain of human TANGO 253 (SEQ ID NO:6), or amino acids 36 to 95 of SEQ 
ID NO: 10, which is the collagen domain of mouse TANGO 253 (SEQ ID NO: 13), while 
maintaining a glycine residue at the first position of G-X-Y repeats within the domain to 
maintain at least 3, 5, 8, 1 0, 12, 15 or 20 contiguous G-X-Y repeats, or while most 
preferably maintaining a glycine repeat at the first position of each G-X-Y repeat within 

25 the domain. 

TANGO 253 family members can also include a Clq domain or at least one of the 
conserved amino acid motifs found therein. As used herein, the term "Clq domain" refers 
to a protein domain that bears homology to a Clq domain present within a member of the 
CI enzyme complex. A Clq domain typically includes about 130-140 amino acid 

30 residues. Clq domains are utilized in processes involving, e.g., correct protein folding and 
alignment and protein-protein interactions. 

In one embodiment, a TANGO 253 family member includes one or more Clq 
domains having an amino acid sequence that is at least 45%, preferably about 50%, 55%, 
60%, 70%, 75%, 80%, 90%, 95% and most preferably at least about 98% identical to 

35 amino acids 105 to 232 of SEQ ID NO:3, which is the human TANGO 253 Clq domain 
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(SEQ P NO:7) or amino acids 1 05 to 232 of SEQ ID NO: 10, which is the mouse 
TANGO 253 Clq domain (SEQ ID NO: 14). 

Embodiments of TANGO 253 family members include, but are not limited to, 
human, mouse and rat TANGO 253 nucleic acids and proteins. The features of the human 

5 and mouse TANGO 253 are described below. A cDNA encoding a rat TANGO 253 
nucleotide sequence (SEQ ED NO:74), identified in clone jtrxaOOl el Otl, is 75.4% 
identical to human TANGO 253 (SEQ ID NO:l) in a 536 bp overlap. Further, the isolated 
rat TANGO 253 nucleotide sequence (SEQ ID NO:74) is 86% identical to mouse TANGO 
253 (SEQ ID NO:9) in a 472 bp overlap. 

10 Embodiments of TANGO 257 family members include, but are not limited to, 

human, mouse and rat TANGO 257 nucleic acids and proteins. The features of the human 
and mouse TANGO 257 are described below. A cDNA encoding a rat TANGO 257 
nucleotide sequence (SEQ ID NO:75), identified within clone jtrxal02g06tl, is 83.8% 
identical to human TANGO 257 (SEQ ID NO: 15) in a 734 bp overlap. Further, the 

1 5 isolated rat TANGO 257 nucleotide sequence (SEQ ID NO:75) is 88.4% identical to 
mouse TANGO 257 (SEQ ID NO:21) in a 731 bp overlap. 

In one example, a TANGO 257 family member includes one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 257 protein contains cytoplasmic 

20 domains of about amino residues 1 to 202 of SEQ ID NO:17 (SEQ ED NO:84) and about 
amino acid residues 338 to 406 of SEQ ID NO:17 (SEQ ID NO:92), transmembrane 
domains of about amino acid residues 203 to 221 of SEQ ID NO:17 (SEQ ID NO:86) and 
about amino acid residues 321 to 337 of SEQ ID NO:17 (SEQ ED NO:87), and an 
extracellular domain of about amino acid residues 222 to 320 of SEQ ID NO: 17 (SEQ ID 

25 NO:88). In an alternative embodiment, a TANGO 257 protein contains an extracellular 
domain of about amino acid residues 1 to 320 of SEQ ID NO: 17 (SEQ ID NO:89) ora 
mature extracellular domain of about amino acid residues 22 to 320 of SEQ ID NO: 17 
(SEQ ID NO:90), a transmembrane domain of about amino acid residues 321 to 337 of 
SEQ ID NO: 17 (SEQ ID NO:87), and a cytoplasmic domain of about amino acid residues 

30 338 to 406 of SEQ ID NO:l 7 (SEQ ID NO:92). In another embodiment, a mature 
TANGO 257 protein contains about amino acid residues 22 to 406 of SEQ ID NO: 17 
(SEQ ID NO: 18). 

In another embodiment, a TANGO 257 protein contains intracellular domains of 
about amino acid residues 1 to 202 of SEQ ID NO:23 (SEQ ID NO:93) and about amino 
35 acid residues 338 to 406 of SEQ ID NO:23 (SEQ ID NO:94), transmembrane domains of 
about amino acid residues 203 to 221 of SEQ ID NO:23 (SEQ ID NO:95) and about 
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amino acid residues 321 to 337 of SEQ ID NO:32 (SEQ ID NO:96), and an extracellular 
domain of about amino acid residues 222 to 320 of SEQ ID NO:23 (SEQ ID NO:97). In 
alternative embodiment, a TANGO 257 protein contains an extracellular domain of about 
amino acid residues 1 to 320 of SEQ ID NO:23 (SEQ ID NO:98) or a mature extracellular 

5 domain of about amino acid residues 22 to 320 of SEQ ID NO:23 (SEQ ID NO:99), a 
transmembrane domain of about amino acid residues 321 to 337 of SEQ ID NO:25 (SEQ 
ID NO:96), and an intracellular domain of about amino acid residues 338 to 406 of SEQ 
ID NO:23 (SEQ ID NO:94). In another embodiment, a mature TANGO 257 protein 
contains about amino acid residues 22 to 406 of SEQ ED NO:23 (SEQ ID NO:24). 

1 o In another example, an INTERCEPT 258 family member includes one or more of 

the following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) 
a cytoplasmic domain. Thus, in one embodiment, an INTERCEPT 258 protein contains 
extracellular domains of about amino acid residues 1 to 206 of SEQ ID NO:28 (SEQ ID 
NO:8 1) or about amino acid residues 30 to 206 of SEQ ID NO: 28 (SEQ ID NO:76) and 

15 about amino acid residues 272 to 370 of SEQ ID NO: 28 (SEQ ID NO:34), 

transmembrane domains of about amino acid residues 207 to 224 of SEQ ID NO:28 (SEQ 
ID NO:78) and about amino acid residues 247 to 271 of SEQ ID NO:28 (SEQ ID NO:33), 
and a cytoplasmic domain of about amino acid residues 225 to 246 of SEQ ID NO:28 
(SEQ ID NO:79). In an alternative embodiment, an INTERCEPT 258 protein contains an 

20 extracellular domain of about amino acid residues 272 to 370 of SEQ ID NO:28 (SEQ ID 
NO:34), a transmembrane domain of about amino acid residues 247 to 271 of SEQ ID 
NO:28 (SEQ ED NO:33), and a cytoplasmic domain of about amino acid residues 1 to 246 
of SEQ ID NO:28 (SEQ ID NO:31) or a mature cytoplasmic domain of about amino acid 
residues 30 to 246 of SEQ ID NO:28 (SEQ ID NO:82). In accordance with these 

25 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
transmembrane and cytoplasmic domain of about amino acids 30 to 370 of SEQ ID NO:28 
(SEQIDNO:29). 

In another embodiment, an INTERCEPT 258 protein contains an extracellular 
domain of about amino acids 1 to 249 of SEQ ID NO:39 (SEQ ID NO:42), or a mature 

30 extracellular domain of about amino acids 30 to 249 of SEQ ID NO:39 (SEQ ID NO:83). 
In another embodiment, an INTERCEPT 258 protein contains a transmembrane domain of 
about amino acids 250 to 274 of SEQ ID NO:39 (SEQ ID NO:44). In another 
embodiment, an INTERCEPT 258 protein contains a cytoplasmic domain of about amino 
acids 275 to 394 of SEQ ED NO:39 (SEQ ID NO:45). In accordance with these 

35 embodiments, an INTERCEPT 258 protein is a mature protein containing an extracellular, 
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transmembrane and cytoplasmic domain of about 30 to 394 of SEQ ED NO:39 (SEQ ID 
NO:40). 

INTERCEPT 258 family members can also include an immunoglobulin (Ig) 
domain contained within the extracellular domain. As used herein, the term "Ig domain" 

5 refers to a protein domain bearing homology to immunoglobulin superfamily members. 
An Ig domain includes about 30-90 amino acid residues, preferably about 40-80 amino 
acid residues, more preferably about 50-70 amino acid residues, still more preferably 
about 55-65 amino acid residues, and most preferably about 57 to 59 amino acid residues. 
In certain embodiments, an Ig domain contains a conserved cysteine residue within about 

10 5 to 15 amino acid residues, preferably about 7 to 12 amino acid residues, and most 
preferably about 8 amino acid residues from its N-terminal end, and another conserved 
cysteine residue within about 1 to 5 amino acid residues, preferably about 2 to 4 amino 
acid residues, and most preferably about 3 amino acid residues from its C-terminal end. 
An Ig domain typically has the following consensus sequence, beginning about 1 

15 to 15 amino acid residues, more preferably about 3 to 10 amino acid residues, and most 
preferably about 5 amino acid residues from the C terminal end of the domain: (FY)-Xaa- 
C-Xaa-(VA)-COO-, wherein (FY) is either a phenylalanine or a tyrosine residue 
(preferably tyrosine), where "Xaa" is any amino acid, C is a cysteine residue, (VA) is 
either a valine or an alanine residue (preferably alanine), and COO- is the protein C 

20 terminus. 

In one embodiment, an INTERCEPT 258 family member includes one or more Ig 
domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 49 to 128 and/or amino acids 

25 167 to 226 of SEQ ID NO:28, which are the Ig domains of human INTERCEPT 258 
(these Ig domains are also represented as SEQ ID NO:35 and 36, respectively). 

In another embodiment, an INTERCEPT 258 family member includes one or more 
Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 

30 and most preferably at least about 95% identical to amino acids 167 to 226 of SEQ ID 
NO:28 (SEQ ID NO:36), includes a conserved cysteine residue about 8 residues 
downstream from the N-terminus of the Ig domain, and has one or more Ig domain 
consensus sequences described herein. In another embodiment, an INTERCEPT 258 
family member includes one or more Ig domains having an amino acid sequence that is at 

35 least 55%, preferably at least about 65%, more preferably at least about 75%, yet more 
preferably at least about 85%, and most preferably at least about 95% identical to amino 
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acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36), includes a conserved cysteine 
residue 8 residues downstream from the N-terminus of the Ig domain, has one or more Ig 
domain consensus sequences described herein, and has a conserved cysteine within the 
consensus sequence that forms a disulfide both with said first conserved cysteine. In yet 

5 another embodiment, an INTERCEPT 258 family member includes one or more Ig 

domains having an amino acid sequence that is at least 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 167 to 226 of SEQ ID NO:28 (SEQ 
ID NO:36), includes a conserved cysteine residue 8 residues downstream from the N- 

10 terminus of the Ig domain, has one or more Ig domain consensus sequences described 
herein, has a conserved cysteine within the consensus sequence that forms a disulfide both 
with said first conserved cysteine, and has at least one INTERCEPT 258 biological 
activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

1 5 acid sequence of SEQ ID NO:28 wherein the aforementioned Ig conserved residues are 
located as follows: the N-terminal conserved cysteine residue is located at about amino 
acid position 174 (within the Ig domain SEQ ID NO:36) and the C-terminal conserved 
cysteine is located at about amino acid position 224 (within the Ig domain SEQ ID 
NO:36). 

20 In another embodiment, an INTERCEPT 258 family member includes one or more 

Ig domains having an amino acid sequence that is at least about 55%, preferably at least 
about 65%, more preferably at least about 75%, yet more preferably at least about 85%, 
and most preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID 
NO:39, which is the Ig domain of mouse INTERCEPT 258 (SEQ ID NO:71). In another 

25 embodiment, an INTERCEPT 258 family member includes one or more Ig domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 
more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 170 to 229 of SEQ ID NO:39 (SEQ 
ID NO:71), includes a conserved cysteine residue about 8 residues downstream from the 

30 N-terminus of the Ig domain, and has one or more Ig domain consensus sequences 
described herein, has a conserved cysteine within the consensus sequence that forms a 
disulfide both with said first conserved cysteine, and has at least one INTERCEPT 258 
biological activity as described herein. 

In a preferred embodiment, an INTERCEPT 258 family member has the amino 

35 acid sequence of SEQ ID NO:39 wherein the aforementioned Ig domain conserved 

residues are located as follows: the N-terminal conserved cysteine residue is located at 
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about amino acid residue position 177 (within the Ig domain SEQ ID NO:71) and the C- 
terminal conserved cysteine residue is located at about amino acid position 227 (within the 
Ig domain SEQIDNO:71). 

In another example, a TANGO 281 family member consists of one or more of the 

5 following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. In one embodiment, a TANGO 281 protein contains an extracellular 
domain at amino acids 1 to about 123 of SEQ ED NO:48 or a mature extracellular domain 
at about amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID NO:51), a 
transmembrane domain at about amino acid residues 124 to 148 of SEQ ID NO:48 (SEQ 

10 ID NO:52), and a cytoplasmic domain at about amino acid residues 149 to 245 of SEQ ID 
NO:48 (SEQ ID NO:53). In another embodiment, a mature TANGO 281 protein contains 
about amino acid residues 39 to 245 of SEQ ID NO: 48 (SEQ ID NO: 50). In another 
embodiment, a TANGO 281 family contains an extracellular domain at amino acids 1 to 
about 112 of SEQ ID NO:58 or a mature extracellular domain at about amino acid residues 

15 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:6 1), a transmembrane domain at about amino 
acid residues 1 13 to 137 of SEQ ID NO:78 (SEQ ID NO:62), and a cytoplasmic domain at 
about amino acid residues 138 to 213 of SEQ ID NO:78 (SEQ ID NO:63). In yet another 
embodiment, a mature TANGO 281 protein contains about amino acid residues 27 to 21 3 
of SEQ ID NO: 58 (SEQ ID NO: 61). 

20 In one embodiment, a TANGO 281 family member includes a signal sequence. In 

a preferred embodiment, a TANGO 281 family member has the amino acid sequence of 
SEQ ID NO:48, and the signal sequence is located at about amino acids 1 to 38. In an 
another preferred embodiment, a TANGO 281 family member has the amino acid 
sequence of SEQ ID NO:58, and the signal sequence is located at about amino acids 1 to 

25 26. 

A photosystem II lOkd phosphoprotein (PSBH) domain has been identified in the 
TANGO 281 proteins. The domain is also present in the chloroplast gene PSBH that 
encodes a 9-10kDa thylakoid membrane protein (PSII-H) which is associated with 
photosystem II. In one embodiment, a TANGO 281 family member includes one or more 

30 PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 
most preferably at least about 95% identical to amino acids 41 to 90 and/or amino acids 
127 to 182 of SEQ ID NO:48, which are the PSBH domains of human TANGO 281 (these 
PSBH domains are also represented as SEQ ID NO:54 and 55, respectively). In another 

35 embodiment, a TANGO 281 family member includes one or more PSBH domains having 
an amino acid sequence that is at least about 55%, preferably at least about 65%, more 
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preferably at least about 75%, yet more preferably at least about 85%, and most preferably 
at least about 95% identical to amino acids 41 to 90 and/or amino acids 127 to 182 of SEQ 
ID NO:48, which are the PSBH domains of human TANGO 281 (these PSBH domains are 
also represented as SEQ ID NO:54 and 55, respectively), includes one or more PSBH 

5 domain consensus sequences described herein, and has at least one TANGO 281 
biological activity as described herein. 

In another embodiment, a TANGO 281 family member includes one or more 
PSBH domains having an amino acid sequence that is at least about 55%, preferably at 
least about 65%, more preferably at least 75%, yet more preferably at least about 85%, and 

10 most preferably at least about 95% to 98% identical to amino acids 42 to 91 and/or amino 
acids 128 to 183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 
(these PSBH domains are also represented as SEQ ID NO:64 and 65, respectively). In 
another embodiment, a TANGO 281 family member includes one or more PSBH domains 
having an amino acid sequence that is at least about 55%, preferably at least about 65%, 

15 more preferably at least about 75%, yet more preferably at least about 85%, and most 
preferably at least about 95% identical to amino acids 42 to 91 and/or amino acids 128 to 
183 of SEQ ID NO:58, which are the PSBH domains of mouse TANGO 281 (these PSBH 
domains are also represented as SEQ ED NO:64 and 65, respectively), includes one or 
more PSBH domain consensus sequences described herein, and has at least one TANGO 

20 281 biological activity as described herein. 

Various features of human and mouse TANGO 253, TANGO 257, INTERCEPT 
258 and TANGO 281 are summarized below. 

Human TANGO 253 

25 A cDNA encoding human TANGO 253 was identified by analyzing the sequences 

of clones present in a coronary artery smooth muscle library for sequences that encode 
secreted proteins. The primary cells utilized in construction of the library had been 
stimulated with agents that included phorbol 12-myristate 13-acetate (PMA), tumor 
neurosis factor (TNF), ionomycin, and cyclohexamide (CHX). This analysis led to the 

30 identification of a clone, Athma27h9, encoding full-length human TANGO 253. The 
human TANGO 253 cDNA of this clone is 1339 nucleotides long (Figures 1A-1B; SEQ 
ID NO:l). The open reading frame of this cDNA, nucleotides 188 to 916 of SEQ ID NO:l 
(SEQ ID NO:2), encodes a 243 amino acid secreted protein (Figures 1A-1B; SEQ ID 
NO:3). 

35 Figure 2 depicts a hydropathy plot of human TANGO 253. Relatively 

hydrophobic regions of the protein are shown above the horizontal line, and relatively 
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hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 15 of SEQ ID NO:3; SEQ ED 
NO:5) on the left from the mature protein (amino acids 15 to 243 of SEQ ID NO:3; SEQ 

5 ID NO:4) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al, 1997, Protein 
Engineering 10:1-6) predicted that human TANGO 253 includes a 15 amino acid signal 
peptide (amino acid 1 to amino acid 15 of SEQ ID NO:3; SEQ ED NO:5) preceding the 
mature human TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 

10 SEQIDNO:3; SEQIDNO:4). The molecular weight of TANGO 253 protein without 
post-translational modifications is 25.3 kDa prior to the cleavage of the signal peptide, 
23.8 kDa after cleavage of the signal peptide. 

Human TANGO 253 includes a collagen domain (at about amino acids 36 to 95 of 
SEQ ID NO:3; SEQ ID NO:6) and a CI q domain (at about amino acids 105 to 232 of SEQ 

1 5 ID NO:3; SEQ ID NO:7) containing 23 G-X-Y repeats. An RGD cell attachment site is 
found at amino acids 77 to 79 of SEQ ID NO:3. 

Three protein kinase C phosphorylation sites are present in human TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO:3), the second 
has the sequence TGK (at amino acids 140 to 142 of SEQ ID NO:3), and the third has the 

20 sequence SIK (at amino acids 220 to 222 of SEQ ID NO:3). Human TANGO 253 has 
three N-myristylation sites. The first has the sequence GLAAGS (at amino acids 1 1 to 16 
of SEQ ID NO:3), the second has the sequence GGRPGL (at amino acids 68 to 73 of SEQ 
ED NO:3) and the third has the sequence GIYASI (at amino acids 216 to 221 of SEQ ID 
NO:3). 

25 Northern analysis of human TANGO 253 expression demonstrates strong 

expression in heart, lung, liver, kidney and pancreas, and moderate expression in brain, 
placenta and skeletal muscle. Liver expression reveals two human TANGO mRNA bands, 
one of approximately 1.3kb (which is the size observed in the other tissues) as well as a 
band at approximately lkb, which may be the result of an alternative splicing event. 

30 Secretion assays reveal a human TANGO 253 protein of approximately 30kDa. 

The secretion assays were performed as follows: 8xl0 5 2 93T cells were plated per well in 
a 6- well plate and the cells were incubated in growth medium (DMEM, 10% fetal bovine 
serum, penicillin/strepomycin) at 37 °C, 5% C0 2 overnight. 293T cells were transfected 
with 2 of full-length TANGO 253 inserted in the pMET7 vector/well and 10 \ig 

35 LipofectAMINE (GEBCO/BRL Cat. U 1 8324-0 1 2) /well according to the protocol for 
GIBCO/BRL LipofectAMfNE. The transfectant was removed 5 hours later and fresh 
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growth medium was added to allow the cells to recover overnight. The medium was 
removed and each well was gently washed twice with DMEM without methionine and 
cysteine (ICN Cat. # 16-424-54). 1 ml DMEM without methionine and cysteine with 50 
nCi Trans- 35 S (ICN Cat. # 51006) was added to each well and the cells were incubated at 

5 37 °C, 5% C0 2 for the appropriate time period. A 150 nl aliquot of conditioned medium 
was obtained and 150 jjlI of 2X SDS sample buffer was added to the aliquot. The sample 
was heat-inactivated and loaded on a 4-20% SDS-PAGE gel. The gel was fixed and the 
presence of secreted protein was detected by autoradiography. 

TANGO 253 exhibits homology to an adipocyte complement-mediated protein 

10 precursor and so may be involved in adipocyte function, e.g., may act as a signaling 

molecule for adipocyte tissue. Figure 6A shows an alignment of the human TANGO 253 
amino acid sequence (SEQ ID NO: 3) with the human adipocyte complement-mediated 
protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
is a 38.7% overall amino acid sequence identity between human TANGO 253 and human 

1 5 adipocyte complement-mediated protein precursor. 

Figures 7A-7C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO:32); GenBank 
Accession Number A1417523) and the nucleotide sequence of human TANGO 253 (SEQ 
ID NO: 1). The alignment shows a 29. 1% overall sequence identity between the two 

20 nucleotide sequences. 

The human TANGO 253 nucleotide sequence was mapped to human chromosome 
1 1, between flanking markers Dl 1S1356 and Dl 1S924 using the Genebridge 4 Human 
Radiation hybrid mapping panel with CAAAGTGAGCTCATGCTCTCAC (SEQ ID 
NO: 193) as the forward primer and CTCTGGTCTTGGGCAGAAATC (SEQ ID NO: 194) 

25 as the reverse primer. 

Clone EpT253, which encodes human TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

30 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

Mouse TANGO 253 

35 A cDNA encoding mouse TANGO 253 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 253 probe from sciatic 
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nerve. This analysis led to the identification of a clone, AtmXalel075, encoding full- 
length mouse TANGO 253. The mouse TANGO 253 cDNA of this clone is 1263 
nucleotides long (Figures 3 A-3B; SEQ ID NO:8). The open reading frame of this cDNA, 
nucleotides 135 to 863 of SEQ ED NO:8 (SEQ ID NO:9), encodes a 243 amino acid 

5 secreted protein (Figures 3 A-3B; SEQ ID NO: 10). 

Figure 4 depicts a hydropathy plot of mouse TANGO 253. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) are 
indicated by short vertical lines just below the hydropathy trace. The dashed vertical line 

10 separates the signal sequence (amino acid 1 to amino acid 15 of SEQ ID NO: 10; SEQ ID 
NO: 12) on the left from the mature protein (amino acid 16 to amino acid 243 of SEQ ID 
NO:10; SEQ ID NO:l 1) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 253 includes a 15 amino acid signal 

15 peptide (amino acid 1 to amino acid 15 of SEQ ID NO:10; SEQ ID NO:12) preceding the 
mature mouse TANGO 253 protein (corresponding to amino acid 16 to amino acid 243 of 
SEQ IDNO:10; SEQ IDNO:ll). The molecular weight of mouse TANGO 253 protein 
without post-translational modifications is 25.4 kDa prior to the cleavage of the signal 
peptide, 23.9 kDa after cleavage of the signal peptide. 

20 Mouse TANGO 253 includes a collagen domain (at amino acids 36 to 95 of SEQ 

ID NO:10; SEQ ID NO:13) and a Clq domain (at amino acids 105-232 of SEQ ID NO:10; 
SEQ ID NO: 14). 

Three protein kinase C phosphorylation sites are present in mouse TANGO 253. 
The first has the sequence SAK (at amino acids 107 to 109 of SEQ ID NO: 10), the second 

25 has the sequence TGK (at amino acids 140 to 142 of SEQ ID NO: 1 0), and the third has the 
sequence SIK (at amino acids 220 to 222 of SEQ ID NO: 10). Mouse TANGO 253 has 
four N-myristylation sites. The first has the sequence GLVSGS (at amino acids 1 1 to 16 
of SEQ ID NO: 10), the second has the sequence GGRPGL (at amino acids 68 to 73 of 
SEQ ID NO: 10), the third has the sequence GQSIAS (at amino acids 172 to 177 of SEQ 

30 ID NO:10), and the fourth has the sequence GIYASI (at amino acids 216 to 221 of SEQ 
ID NO: 10). 

As shown in Figure 5, human TANGO 253 protein and mouse TANGO 253 
protein are 93.8% identical. Figure 6B shows an alignment of the mouse TANGO 253 
amino acid sequence (SEQ ID NO: 10) with the human adipocyte complement-mediated 
35 protein precursor amino acid sequence (SEQ ID NO:20). The alignment shows that there 
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is a 38.3% overall amino acid sequence identity between mouse TANGO 253 and human 
adipocyte complement-mediated protein precursor. 

Figures 8A-8C shows an alignment of the nucleotide sequence of human adipocyte 
complement-mediated protein precursor nucleotide sequence (SEQ ID NO:32); GenBank 

5 Accession Number A 141 7523) and the nucleotide sequence of mouse TANGO 253 (SEQ 
ID NO:8). The alignment shows a 30.4% overall sequence identity between the two 
nucleotide sequences. 

In situ tissue screening was performed on mouse embryonic tissue (obtained from 
embryos at embryonic day 13.5 to postnatal day 1.5) and adult tissue to determine the 

10 expression of mouse TANGO 253 mRNA. Expression of mouse TANGO 253 during 

embryogenesis was ubiquitously expressed throughout the central nervous system. Strong 
expression of mouse TANGO 253 was detected in choriod plexus of the fourth ventricle 
of E18.5 and El. 5 embryos examined. Expression of mouse TANGO 253 was also 
detected in the lungs of E14.5 and El 5.5 embryos and in the kidneys of E15.5 embryos. 

1 5 Mouse TANGO 253 expression was detected by in situ hybridization in the 

following adult tissues: a signal was detected in the brain in the choroid plexus of the 
lateral and 4th ventricles, and the olfactory bulb; a signal was detected in the cortical 
region of the kidney consistent with the pattern of glomeruli (in particular, the cortical 
radial veins); a ubiquitous signal was detected in the thymus; a weak, ubiquitous signal 

20 was detected in the spleen; a moderate signal was associated with the seminiferous 
vesicles of the testes; a signal was detected in the ovaries; and a ubiquitous signal 
restricted to the zone of giant cells was detected in the placenta. 

Clone EpTm253, which encodes mouse TANGO 253, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

25 2209) on April 21 , 1999 and assigned Accession Number 207215. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C §112. 

30 

Uses of TANGO 253 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 253 was originally found in the coronary artery smooth muscle library 
described above, TANGO 253 nucleic acids, proteins, and modulators thereof can be used 
to modulate the proliferation, development, differentiation, and/or function of organs, e.g., 
35 tissues and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 
connective tissue, e.g., abnormal coronary smooth muscle cells and/or endothelial cells of 
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blood vessels. TANGO 253 nucleic acids, proteins, and modulators thereof can also be 
used to modulate symptoms associated with abnormal coronary function, e.g., heart 
diseases and disorders such as atherosclerosis, coronary artery disease and plaque 
formation. 

5 In light of the collagen domain, TANGO 253 nucleic acids, proteins and 

modulators thereof can be utilized to modulate (e.g., stabilize, promote, inhibit or disrupt) 
cell/extracellular matrix (ECM) interactions, cell/cell interactions and, for example, signal 
transduction events associated with such interactions. For example, such TANGO 253 
compositions and modulators thereof can be used to modulate binding of such ECM- 

10 associated factors as integrin and can function to modulate ligand binding to cell surface 
receptors. In addition, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate connective tissue formation, maintenance and function, as well as to 
modulate symptoms associated with connective tissue-related disorders, to promote wound 
healing, and to reduce, slow or inhibit ameliorate connective tissue-related signs of aging, 

1 5 such as wrinkle formation. 

In light of the Clq domain exhibited by TANGO 253 proteins and their similarity 
to the collectin family, TANGO 253 nucleic acids, proteins and modulators thereof can be 
utilized to modulate immune-related processes such as the ability to modulate host 
immune response by, e.g., modulating one or more elements in the serum complement 

20 cascade, including, for example activation of the cascade, formation of and/or binding to 
immune complexes, detection and defense against surface antigens and bacteria, and 
immune surveillance for rapid removal or pathogens. Such TANGO 253 compositions 
and modulators thereof can be utilized, e.g., to ameliorate incidence of any symptoms 
associated with disorders that involve such immune-related processes, including, but not 

25 limited to infection and autoimmune disorders. 

In addition, such compositions and modulators thereof can be utilized to modulate 
folding and alignment of the collagen domain (e.g., into a triple helix), disorders 
associated with collagen defects, including but not limited to bone disorders, e.g., bone 
resorption disorders, or hearing, e.g., inner ear, disorders, to modulate protein-protein 

30 interactions and recognition events (either homotypic or heterotypic) and cellular response 
events (e.g., signal transduction events) associated with such interactions and recognitions, 
and to ameliorate symptoms associated with abnormal signaling, protein-protein 
interaction and/or cellular response events including, but not limited to cell proliferation 
disorders such as cancer, abnormal neuronal interactions, such as disorders involving 

35 abnormal synaptic activity, e.g., abnormal Purkinje cell activities. 
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Human TANGO 253 protein contains an RGD domain. As such, TANGO 253 
nucleic acids, proteins and modulators thereof can be utilized to modulate processes 
involved in, e.g., bone development, sepsis, tumor progression, metastasis, cell migration, 
fertilization, and cellular interactions with the extracellular matrix required for growth, 
5 differentiation, and apoptosis, as well as cellular processes involving cell adhesion, such as 
cell migration. 

TANGO 253 proteins exhibit similarity to adipocyte complement-related protein 
precursor and can act as signaling molecules for adipocyte tissue. In light of this, TANGO 
253 nucleic acids, proteins and modulators thereof can be utilized to modulate adipocyte 

10 function and adipocyte-related processes and disorders such as, e.g., obesity. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
modulate the development, differentiation, maturation, proliferation and/or activity of cells 
of the central nervous system such as neurons, glial cells (e.g., astrocytes and 
oligodendrocytes), and Schwann cells. TANGO 253 nucleic acids, polypeptides, or 

15 modulators thereof can also be used to treat disorders of the brain, such as cerebral edema, 
hydrocephalus, brain herniations, iatrogenic disease (due to, e.g., infection, toxins, or 
drugs), inflammations (e.g., bacterial and viral meningitis, encephalitis, and cerebral 
toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, ischemia, and infarction, 
intracranial hemorrhage and vascular malformations, and hypertensive encephalopathy), 

20 tumors (e.g., neuroglial tumors, neuronal tumors, tumors of pineal cells, meningeal 

tumors, primary and secondary lymphomas, intracranial tumors, and medulloblastoma), 
and to treat injury or trauma to the brain. 

TANGO 253 nucleic acids, proteins, and modulators thereof can also be utilized to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

25 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

30 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

35 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
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TANGO 253 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 
involved in microglial, lung, liver, kidney, pancreas, brain, placental and skeletal muscle 
formation and activity, as well as in ameliorating any symptom associated with a disorder 
5 of such cell types, tissues and organs. 

TANGO 253 expression can be utilized as a marker {e.g., an in situ marker) for 
specific tissues {e.g., the brain) and/or cells {e.g., neurons) in which TANGO 253 is 
expressed. TANGO 253 nucleic acids can also be utilized for chromosomal mapping. 

10 Human TANGO 257 

A cDNA encoding human TANGO 257 was identified by analyzing the sequences 
of clones present in a coronary smooth muscle library for sequences that encode secreted 
proteins. This analysis led to the identification of a clone, Athma7cl0, encoding full- 
length human TANGO 257. The human TANGO 257 cDNA of this clone is 1832 

1 5 nucleotides long (Figures 9A-9B; SEQ ID NO: 1 5). The open reading frame of this cDNA, 
nucleotides 88 to 1305 of SEQ ID NO: 15 (SEQ ID NO: 16), encodes a 406 amino acid 
secreted protein (Figures 9A-9B; SEQ ID NO: 17). 

Figure 1 0 depicts a hydropathy plot of human TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, and relatively hydrophilic 

20 regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

25 Engineering 10: 1 -6) predicted that human TANGO 257 includes a 2 1 amino acid signal 
peptide (amino acid 1 to amino acid 21 of SEQ ID NO: 17; SEQ ID NO: 19) preceding the 
mature human TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of 
SEQ ID NO:17; SEQ ID NO:18). The molecular weight of human TANGO 257 protein 
without post-translational modifications is 46.0 kDa prior to the cleavage of the signal 

30 peptide, 43.8 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in human TANGO 257. The first has the 
sequence NDTA and is found at amino acids 177 to 180 of SEQ ID NO: 17, and the second 
has the sequence NRTV and is found at amino acids 248 to 251 of SEQ ID NO: 17. A 
cAMP and cGMP dependent protein kinase phosphorylation site having the sequence 

35 RJCAS is found in human TANGO 257 at amino acids 196 to 199 of SEQ ID NO: 17. Five 
protein kinase C phosphorylation sites are present in human TANGO 257. The first has 
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the sequence SSR (at amino acids 48 to 50 of SEQ ID NO: 17), the second has the 
sequence SGR (at amino acids 84 to 86 of SEQ ED NO: 17), the third has the sequence 
SMK (at amino acids 144 to 146 of SEQ ID NO: 17), the fourth has the sequence TEK (at 
amino acids 166 to 168 of SEQ ID NO: 17) and the fifth has the sequence SLR (at amino 

5 acids 374 to 376 of SEQ ID NO: 17). Five casein kinase II phosphorylation sites are 

present in human TANGO 257. The first has the sequence TEAD (at amino acids 78 to 8 1 
of SEQ ID NO: 17), the second has the sequence TQND (at amino acids 175 to 1 78 of 
SEQ ID NO: 17), the third has the sequence TVVD (at amino acids 250 to 253 of SEQ ID 
NO: 1 7), the fourth has the sequence TYID (at amino acids 272 to 275 of SEQ ED NO: 1 7), 

1 0 and the fifth has the sequence TRED (at amino acids 289 to 292 of SEQ ED NO: 1 7). 
Human TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 
RLEREVDY at amino acids 89 to 96 of SEQ ID NO: 17). Human TANGO 257 has three 
N-myristylation sites. The first has the sequence GGPGTK (at amino acids 1 15 to 120 of 
SEQ ID NO: 17), the second has the sequence GGPAGL (at amino acids 152 to 157 of 

15 SEQ ID NO: 17) and the third has the sequence GAHASL (at amino acids 370 to 375 of 
SEQ ID NO: 17). Human TANGO 257 has an amidation site having the sequence KGRR 
at amino acids 122 to 125 of SEQ ID NO:17. 

Northern analysis of human TANGO 257 expression demonstrates moderate 
expression in heart, liver and pancreas, and low expression in kidney, lung and skeletal 

20 muscle. 

Secretion assays reveal a human TANGO 257 protein of approximately 50kDa. 
The secretion assays were performed as described in the human TANGO 253 section 
above. 

The human TANGO 257 nucleotide sequence was mapped to human chromosome 
25 1 using the Genebridge 4 Human Radiation hybrid mapping panel with GGATGATGG 
CTACCAGATTGTC (SEQ ID NO: 195) as the forward primer and 
GGAACATTGAGGGTTTTGACTC (SEQ ID NO: 196) as the reverse primer. 

TANGO 257 is homologous to a protein encoded by a nucleic acid sequence 
referred to in PCT Publication WO 98/39446 as "gene 64". Figure 14 shows an alignment 
30 of the human TANGO 257 amino acid sequence (SEQ ID NO: 1 7) with the gene 64 
encoded amino acid sequence (SEQ ID N0.43). As shown in the figure, the 353 amino 
acid gene 64 polypeptide is identical to amino acid residues 1-353 of human TANGO 257 
(SEQ ID NO: 17). Human TANGO 257 contains 406 amino acids, i.e., contains an 
additional 53 amino acid residues carboxy to residue 353. The overall amino acid 
35 sequence identity between full-length human TANGO 257 polypeptide and the gene 64- 
encoded polypeptide is approximately 87%. 
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Figures 15A-15D show an alignment of the nucleotide sequence of gene 64 (SEQ 
ID NO;66; PCT Publication WO 98/39446) and the nucleotide sequence of human 
TANGO 257 (SEQ ID NO: 15). The nucleotide sequences of gene 64 and human TANGO 
257 are 93.5% identical. Among the differences between the sequences is a cytosine 

5 nucleotide at human TANGO 257 (SEQ ID NO: 1 5) position 1 587 that represents an 
insertion relative to the corresponding gene 64 position when the gene 64 and TANGO 
. 257 sequences are aligned. This additional cytosine results in the TANGO 257 open 
reading frame being 1218 base pairs encoding a polypeptide of 406 amino acid residues. 
In contrast, the gene 64 nucleic acid sequence encodes a polypeptide of only 353 amino 

1 0 acid residues, as discussed above. 

Clone EpT257, which encodes human TANGO 257, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 

1 5 Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

Mouse TANGO 257 

20 A cDNA encoding mouse TANGO 257 was identified by analyzing the sequences 

of clones present in a mouse microglia library using a rat TANGO 257 probe. This 
analysis led to the identification of a clone, Atmual02gbl, encoding full-length mouse 
TANGO 257. The mouse TANGO 257 cDNA of this clone is 1721 nucleotides long 
(Figures 1 1A-1 IB; SEQ ID NO:21). The open reading frame of this cDNA, nucleotides 

25 31 to 1248 of SEQ ID NO:21 (SEQ ID NO:22), encodes a 406 amino acid secreted protein 
(Figures 11A-I IB; SEQ ID NO:23). 

Figure 12 depicts a hydropathy plot of mouse TANGO 257. Relatively 
hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 

30 glycosylation sites are (Ngly) are indicated by short vertical lines just below the 

hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that mouse TANGO 257 includes a 21 amino acid signal 

35 peptide (amino acid 1 to amino acid 21 of SEQ ID NO:23; SEQ ID NO:25) preceding the 
mature TANGO 257 protein (corresponding to amino acid 22 to amino acid 406 of SEQ 
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ID NO:23; (SEQ ID NO:24). The molecular weight of mouse TANGO 257 protein 
without post-translational modifications is 45.8 kDa prior to the cleavage of the signal 
peptide, 43.6 kDa after cleavage of the signal peptide. 

Two N-glycosylation sites are present in mouse TANGO 257. The first has the 

5 sequence NDTA and is found at amino acids 177 to 180 of SEQ ID NO:23, and the second 
has the sequence NRTV and is found at amino acids 248 to 25 1 of SEQ ID NO:23. A 
cAMP and cGMP-dependent protein kinase phosphorylation site having the sequence 
RKAS is found in mouse TANGO 257 at amino acids 196 to 199 of SEQ ID NO:23. Five 
protein kinase C phosphorylation sites are present in mouse TANGO 257. The first has 

10 the sequence SSR (at amino acids 48 to 50 of SEQ ID NO:23), the second has the 
sequence TLR (at amino acids 75 to 77 of SEQ ID NO:23), the third has the sequence 
SGR (at amino acids 84 to 86 of SEQ ID NO:23), the fourth has the sequence SMK (at 
amino acids 144 to 146 of SEQ ID NO:23) and the fifth has the sequence SLR (at amino 
acids 374 to 376 of SEQ ED NO:23). Five casein kinase II phosphorylation sites are 

15 present in mouse TANGO 257. The first has the sequence TEAD (at amino acids 78 to 81 
of SEQ ED NO:23), the second has the sequence TQND (at amino acids 175 to 178 of 
SEQ ID NO:23), the third has the sequence TVVD (at amino acids 250 to 253 of SEQ ID 
NO:23), the fourth has the sequence TYED (at amino acids 272 to 275 of SEQ ID NO:23), 
and the fifth has the sequence TRRD (at amino acids 289 to 292 of SEQ ID NO:23). 

20 Mouse TANGO 257 has a tyrosine kinase phosphorylation site having the sequence 

RLEREVDY at amino acids 89 to 96 of SEQ ID NO:23. Mouse TANGO 257 has four N- 
myristylation sites. The first has the sequence GGPGAK (at amino acids 1 15 to 120 of 
SEQ ID NO:23), the second has the sequence GGSVGL (at amino acids 151 to 157 of 
SEQ ID NO:23), the third has the sequence GGPGGG (at amino acids 227 to 232 of SEQ 

25 ID NO:23), and the fourth has the sequence GAHASL (at amino acids 370 to 375 of SEQ 
ID NO:23). Mouse TANGO 257 has an amidation site having the sequence KGRR at 
amino acids 122 to 125 of SEQ ID NO:23. 

As shown in Figure 13, human TANGO 257 protein and mouse TANGO 257 
protein are 94.1% identical. 

30 Figure 16 shows an alignment of mouse TANGO 257 amino acid sequence (SEQ 

ID NO:23) with the amino acid sequence encoded by gene 64 (SEQ ED NO:43). As 
shown in the figure, the 253 amino acid gene 64 polypeptide and the 406 amino acid 
mouse TANGO 257 polypeptide and the 406 amino acid mouse TANGO 257 polypeptide 
are approximately 82% identical. Figures 17A-17C show an alignment of the nucleotide 

35 sequence of gene 64 (SEQ ID NO:66; PCT publication no. 98/39446) and the nucleotide 
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sequence of mouse TANGO 257 (SEQ ID NO:21). As shown in the figure, the two 

nucleotide sequences are approximately 76% identical. 

In situ tissue screening was performed on mouse adult tissues and embryonic 

tissues (obtained from embryos E13.5 to PI. 5) to analyze for the expression of mouse 
5 TANGO 257 mRNA. Mouse TANGO 257 expression was detected the following adult 

tissues: the submandibular gland; the renal papilla region of the kidney; the capsule region 

of the adrenal gland; and the labyrinth zone of the placenta. 

In the case of embryonic expression, mouse TANGO 257 expression was detected 

in the bones, lungs, intestines, and kidneys. At E13.5, a signal was detected in many 
10 tissues including the developing bone structures such as the vertebrae, of the spinal 

column, jaw, and scapula. At E14.5, the signal pattern was very similar to that detected at 

E13.5. At 15.5, a signal was detected in all major bone structures, including the skull, 

basisphenoid bone, upper and lower incisor teeth, vertebral column, sternum, scapula, and 

femur. A ubiquitous signal was also detected in the lung, kidney, and intestinal tract. At 
15 16.5 and 18.5, the signal is very similar to that detected at E15.5. At PI .5, a signal was 

still detected in all of the major bone structures and signal detected in the lung, kidney, 

and intestines has dropped to nearly background levels. 

Clone EpTm257, which encodes mouse TANGO 257, was deposited with the 

American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
20 2209) on April 21 , 1999 and assigned Accession Number 2071 17. This deposit will be 

maintained under the terms of the Budapest Treaty on the International Recognition of the 

Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 

merely as a convenience for those of skill in the art and is not an admission that a deposit 

is required under 35 U.S.C. §112. 

25 

Uses of TANGO 257 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 257 was originally found in a coronary artery smooth muscle library, 
TANGO 257 nucleic acids, proteins, and modulators thereof can be used to modulate the 
proliferation, development, differentiation, and/or function of organs, e.g., heart, tissues 

30 and cells that form blood vessels and coronary tissue, e.g., cells of the coronary 

connective tissue, e.g., coronary smooth muscle cells and/or endothelial cells of blood 
vessels. TANGO 257 nucleic acids, proteins, and modulators thereof can also be used to 
modulate symptoms associated with abnormal coronary function, e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plaque formation. 

35 In light of TANGO 257's homology to the extracellular molecule olfactomedin, 

TANGO 257 nucleic acids, proteins and modulators thereof can be utilized to modulate 



-44- 



WO 00/78808 



PCT/US00/16883 



development, differentiation, proliferation and/or activity of neuronal cells, e.g., olfactory 
neurons and to modulate neuronal activities involving maintenance, growth and/or 
differentiation of chemosensory cilia, modulate cell-cell interactions and cell-ECM 
interactions, e.g., neuronal (such as olfactory) cell-ECM interactions. TANGO 257 

5 nucleic acids, proteins and modulations thereof can also be used to modulate symptoms 
associated with abnormal processes involving such cells and/or activities, for example 
neuronal function, e.g., neurological disorders, neurodegenerative disorders, 
neuromuscular disorders, cognitive disorders, personality disorders, and motor disorders, 
and chemosensory disorders, such as olfactory-related disorders. 

10 TANGO 257 exhibits homology to a gene referred to as "gene 64" (PCT 

Publication No. WO 98/39446), which is expressed primarily in fetal lung tissue. In light 
of this, TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate development, differentiation, proliferation and/or activity of pulmonary system 
cells, e.g., lung cell types, and to modulate a symptom associated with disorders of 

15 pulmonary development, differentiation and/or activity, e.g., cystic fibrosis. TANGO 257 
nucleic acids, proteins and modulators thereof can also be used to modulate symptoms 
associated with abnormal pulmonary development or function, such as lung diseases or 
disorders associated with abnormal pulmonary development or function, e.g., cystic 
fibrosis. TANGO 257 nucleic acids, polypeptides, or modulators thereof can be used to 

20 treat pulmonary (lung) disorders, such as atelectasis, cystic fibrosis, rheumatoid lung 
disease, pulmonary congestion or edema, chronic obstructive airway disease (e.g., 
emphysema, chronic bronchitis, bronchial asthma, and bronchiectasis), diffuse interstitial 
diseases (e.g., sarcoidosis, pneumoconiosis, hypersensitivity pneumonitis, bronchiolitis, 
Goodpasture's syndrome, idiopathic pulmonary fibrosis, idiopathic pulmonary 

25 hemosiderosis, pulmonary alveolar proteinosis, desquamative interstitial pneumonitis, 
chronic interstitial pneumonia, fibrosing alveolitis, hamman-rich syndrome, pulmonary 
eosinophilia, diffuse interstitial fibrosis, Wegener's granulomatosis, lymphomatoid 
granulomatosis, and lipid pneumonia), or tumors (e.g., bronchogenic carcinoma, 
bronchiolovlveolar carcinoma, bronchial carcinoid, hamartoma, and mesenchymal 

30 tumors). 

TANGO 257 nucleic acids, proteins and modulators thereof can also be used to 
modulate cell proliferation, e.g., abnormal cell proliferation. Such modulation may, for 
example, be via modulation of one or more elements involved in signal transduction 
cascades. 

35 TANGO 257 nucleic acids, proteins and modulators thereof can also be utilized to 

modulate the development, differentiation, maturation, proliferation and/or activity of 
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bone cells such as osteocytes, and to treat bone associated diseases or disorders. Examples 
of bone diseases and disorders include bone injury due to for example, trauma (e.g., bone 
breakage, cartilage tearing), degeneration (e.g., osteoporosis), degeneration of joints, e.g., 
arthritis, e.g., osteoarthritis, and bone wearing. Further, TANGO 257 nucleic acids, 

5 proteins and modulators thereof can be utilized to modulate or regulate the development of 
bone structures such as the skull, the basisphenoid bone, the upper and lower incisor teeth, 
the vertebral column, the sternum, the scapula, and the femur during embryogenesis. 

TANGO 257 nucleic acids, proteins and modulators thereof can, in addition to the 
above, be utilized to regulate or modulate development and/or differentiation of processes 

10 involved in microglial, liver, kidney, and skeletal muscle formation and activity, as well 
as in ameliorating a symptom associated with a disorder of such cell types, tissues and 
organs. 

TANGO 257 nucleic acids, polypeptides, or modulators thereof can also be used to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 

15 glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 
as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 

20 renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 
induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 

25 microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 
TANGO 257 polypeptides, nucleic acids, or modulators thereof can be used to treat 
intestinal disorders, such as ischemic bowel disease, infective enterocolitis, Crohn's 
disease, benign tumors, malignant tumors (e.g., argentaffinomas, lymphomas, 

30 adenocarcinomas, and sarcomas), malabsorption syndromes (e.g., celiac disease, tropical 
sprue, Whipple's disease, and abetalipoproteinemia), obstructive lesions, hernias, 
intestinal adhesions, intussusception, or volvulus. 

Further, TANGO 257 expression can be utilized as a marker (e.g. an in situ 
marker) for specific tissues (i.e., bone structures) and/or cells (i.e., osteocytes) in which 

35 TANGO 257 is expressed. TANGO 257 nucleic acids can also be used for chromosomal 
mapping. 
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Human INTERCEPT 258 

A cDNA encoding human INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a human mixed lymphocyte reaction library for sequences 
that encode secreted proteins. This analysis led to the identification of a clone, Athlxtce, 

5 encoding full-length human INTERCEPT 258. The human INTERCEPT 258 cDNA of 
this clone is 1869 nucleotides long (Figures 18A-18B; SEQ ID NO:26). The open reading 
frame of this cDNA, nucleotides 153 to 1262 of SEQ ID NO:26 (SEQ ID NO:27), encodes 
a 370 amino acid transmembrane protein (Figures 18A-18B; SEQ ID NO:28). 

Figure 19 depicts a hydropathy plot of human INTERCEPT 258. Relatively 

10 hydrophobic regions of the protein are shown above the horizontal line, and relatively 
hydrophilic regions of the protein are below the horizontal line. The cysteine residues 
(cys) are indicated by short vertical lines just below the hydropathy trace. The dashed 
vertical line separates the signal sequence (amino acids 1 to 29 of SEQ ID NO:28; SEQ ID 
NO:30) on the left from the mature protein (amino acids 30 to 370 of SEQ ID NO:28; 

15 SEQ ID NO:29) on the right. 

The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 
Engineering 10:1-6) predicted that human INTERCEPT 258 includes a 29 amino acid 
signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:26; SEQ ID NO:30) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 

20 acid 370 of SEQ ID NO:26; SEQ ID NO:29). The molecular weight of human 

INTERCEPT 258 protein without post-translational modifications is 40.0 kDa prior to the 
cleavage of the signal peptide, 37.0 kDa after cleavage of the signal peptide. 

Human INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids amino acids 207 to 224 of SEQ ID NO:28 (SEQ ID NO:78) and amino acids 247 to 

25 271 of SEQ ID NO:28 (SEQ ID NO:33). Human INTERCEPT 258 also contains two Ig 
domains, one at amino acids 49 to 128 of SEQ ID NO:28 (SEQ ID NO:35) and a second at 
amino acids 167 to 226 of SEQ ID NO:28 (SEQ ID NO:36). 

Five N-glycosylation sites are present in human INTERCEPT 258. The first has 
sequence NLSL and is found at amino acids 108 to 1 1 1 of SEQ ID NO:28, the second has 

30 the sequence NUTL and is found at amino acids 169 to 172 of SEQ ID NO:28; the third is 
has the sequence NLSS and is found at amino acids 213 to 216 of SEQ ID NO:28, the 
fourth has the sequence NUTL and is found at amino acids, 236 to 239 of SEQ ID NO:28, 
and the fifth has the sequence NGTL and is found at amino acids 307 to 3 10 of SEQ ID 
NO:28. Seven protein kinase C phosphorylation sites are present in human INTERCEPT 

35 258. The first has the sequence TSK and is found at amino acids 93 to 95 of SEQ ID 
NO:28, the second has the sequence SLR and is found at amino acids 1 10 to 1 12 of SEQ 
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ID NO:28, the third has the sequences SIK and is found at amino acids 141 to 143 or SEQ 
ID NO:28, the fourth has the sequence SCR and is found at amino acids 157 to 159, the 
fifth has the sequence SPR and is found at amino acids 176 to 179 of SEQ ID NO:28, the 
sixth has the sequence SAR and is found at amino acids 315 to 317 of SEQ ID NO:28, and 

5 the seventh has the sequence SPR and is found at amino acids 344 to 346 of SEQ ID 

NO:28. The human INTERCEPT 258 protein has seven N-myristoylation sites. The first 
has the sequence GUTTSK and is found at amino acids 90 to 95 of SEQ ID NO:28, the 
second has the sequence GANVTL and is found at amino acids 167 to 172 of SEQ ID 
NO:28, the third has the sequence GVYVCK and is found at amino acids 220 to 225, the 

10 fourth has the sequence GTAQCN and is found at amino acids 23 1 to 236 of SEQ ID 
NO:28, the fifth has the sequence GTLVGL and is found at amino acids 256 to 261, the 
sixth has the sequence GLLAGL and is found at amino acids 262 to 267 of SEQ ID 
NO:28, and the seventh has the sequence GTLSSU and is found at acids 308 to 313 of 
SEQIDNO:28. 

15 The human INTERCEPT 258 gene was mapped to human chromosome 1 1 using 

Genebridge 4 Human Radiation hybrid mapping panel with 
GGAGTATCCTTGGTCTACTCC (SEQ ED NO: 197) as the forward primer and 
GAAAGTCTGGAAGGATGGAAGCT (SEQ ID NO: 198) as the reverse primer. 

Human multi-tissue dot blot analysis of human INTERCEPT 258 expression 

20 demonstrates strongest expression in lung, fetal lung, placenta, thyroid gland and 

mammary gland. Moderate expression is observed in heart, aorta, kidney, small intestine, 
fetal heart, fetal kidney, fetal spleen, uterus, and stomach. Weak expression is observed in 
whole brain, amygdala, caudate nucleus, cerebellum, cerebral cortex frontal lobe, 
hippocampus, medulla oblongata, occipital lobe, putamen, substantia nigra, temporal lobe, 

25 thalamus, acumens, spinal cord, skeletal muscle, colon, bladder, prostate, ovary, pancreas, 
pituitary gland, adrenal gland, salivary gland, liver, spleen, thymus, lymph node, bone 
marrow, appendix, trachea, fetal brain, fetal liver, and fetal thymus. 

A human cancer cell line Northern blot analysis showed a roughly 2.0 kb 
INTERCEPT 258 band only in the lane containing cell line Chronic Myelogenous 

30 Leukemia (K-562). The cancerous cell lines in which INTERCEPT 258 was not 
expressed include promyeocytic leukemia, Hela, lymphoblastic leukemia, Burkitt's 
lymphoma Raji, colorectal adenocarcinoma, lung carcinoma and melanoma. 

INTERCEPT 258 exhibits homology to a human A33 antigen. A33 antigen is a 
transmembrane glycoprotein and a member of the immunoglobulin superfamily that may 

35 represent a cancer cell marker (Heath et al., 1997, Proc. Natl. Acad. Sci. USA 94:469- 

474). Figure 23 shows an alignment of the human INTERCEPT 258 amino acid sequence 
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(SEQ ID NO:28) with the human A33 amino acid sequence (SEQ ID NO:67). The 
alignment shows that there is a 23.0% overall amino acid sequence identity between 
human INTERCEPT 258 and A33. Figures 24A-24D show an alignment of the human 
INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) with that of human A33 

5 nucleotide sequence (SEQ ID NO:68). The alignment shows that there is a 40.6% identity 
between the two sequences. 

Human INTERCEPT 258 nucleotide sequence (SEQ ID NO:26) exhibits 
homology to human PECAM-1 nucleotide sequence (SEQ ID NO:72). Figures 27A-27E 
show that there is an overall 40.5% identity between the two nucleotide sequences. 

1 0 Human INTERCEPT 258 amino acid sequence (SEQ ID NO:28) and human PECAM-1 
amino acid sequence (SEQ ID NO:73) share less than 18% identity. PECAM-1 (platelet 
endothelial cell adhesion molecule-1) is an integrin expressed on endothelial cells. 

Clone EpT258, which encodes human INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 

15 2209) on April 21, 1999 and assigned Accession Number 207222. This deposit will be 
maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

20 

Mouse INTERCEPT 258 

A cDNA encoding mouse INTERCEPT 258 was identified by analyzing the 
sequences of clones present in a mouse megakaryocyte library for sequences that encode 
secreted proteins. This analysis led to the identification of a clone, Athmeal7c8, encoding 

25 full-length mouse INTERCEPT 258. The mouse INTERCEPT 258 cDNA of this clone is 
1 846 nucleotides long (Figures 20A-20B; SEQ ED NO:37). The open reading frame of 
this cDNA, nucleotides 107 to 1288 of SEQ ID NO:37 (SEQ ED NO:38), encodes a 394 
amino acid transmembrane protein (Figures 20A-20B, SEQ ID NO:39). 

Figure 21 depicts a hydropathy plot for mouse INTERCEPT 258. Relatively 

30 hydrophobic regions of the protein are above the horizontal line, relatively hydrophilic 
regions of the protein are below the horizontal line. The cysteine residues (cys) and N- 
glycosylation sites are (Ngly) are indicated by short vertical lines just below the 
hydropathy trace. The dashed vertical line separates the signal sequence from the mature 
protein described below. 

35 The signal peptide prediction program SIGNALP (Nielsen et al., 1997, Protein 

Engineering 10:1-6) predicted that mouse INTERCEPT 258 includes a 29 amino acid 
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signal peptide (amino acid 1 to amino acid 29 of SEQ ID NO:39; SEQ ID NO:41) 
preceding the mature INTERCEPT 258 protein (corresponding to amino acid 30 to amino 
acid 394 of SEQ ID NO:39; SEQ ID NO:40). The molecular weight INTERCEPT 258 
without post-translational modifications is 41.8 kDa prior to the cleavage of the signal 

5 peptide, 38.90 kDa after cleavage of the signal peptide. 

Mouse INTERCEPT 258 contains a hydrophobic transmembrane domain at amino 
acids 250 to 274 SEQ ID NO:39 (SEQ ID NO:44). Mouse INTERCEPT 258 also contains 
an Ig domain at amino acids 170 to 229 of SEQ ID NO:39 (SEQ ID NO:71). 

Five N-glycosylation sites are present in mouse INTERCEPT 258. The first has 

10 sequence NVSL and is found at amino acids 1 1 1 to 1 14 of SEQ ID NO:39, the second has 
the sequence NVTL and is found at amino acids 172 to 175 of SEQ ID NO:39, the third 
has the sequence NLSI and is found at amino acids 216 to 219 of SEQ ED NO:39, the 
fourth has the sequence NVTL and is found at amino acids, 239 to 242 of SEQ ID NO:39, 
and the fifth has the sequence NGTL and is found at amino acids 310 to 313 of SEQ ID 

15 NO:39. Nine protein kinase C phosphorylation sites are present in mouse INTERCEPT 
258. the first has the sequence TNK and is found at amino acids 96 to 98 of SEQ ID 
NO:39, the second has the sequence SSR and is found at amino acids 108 to 1 10 of SEQ 
ID NO:39, the third has the sequence SLR and is found at amino acids 1 13 to 1 15 of SEQ 
ID NO:39, the fourth has the sequence TYR and is found at amino acids 126 to 128, the 

20 fifth has the sequence SIK and is found at amino acids 144 to 146 of SEQ ID NO:39, the 
sixth has the sequence SPR and is found at amino acids 179 to 181 of SEQ ID NO:39, the 
seventh has the sequence SLK and is found at amino acids 211 and 213, the eighth has the 
sequence SAR and is found at amino acids 318 to 320 of SEQ ID NO:39, and the ninth 
has the sequence SPR and is found at amino acids 348 to 350 of SEQ ID NO:39. The 

25 mouse INTERCEPT 258 contains a casein kinase II phosphorylation site having the 
sequence TLEE, found at amino acids 280 to 283 of SEQ ID NO:39. The mouse 
INTERCEPT 258 protein has nine N-myristoylation sites. The first has the sequence 
GTPETS and is found at amino acids 6 to 1 1 of SEQ ID NO:39, the second has the 
sequence GVMTNK and is found at amino acids 125 to 130 of SEQ ID NO:39, the third 

30 has the sequence GTYRCS and is found at amino acids 125 to 130, the fourth has the 
sequence GTNVTL and is found at amino acids 170 to 175 of SEQ ID NO:39, the fifth 
has the sequence GVYVCK and is found at amino acids 223 to 228, the sixth has the 
sequence GSKAAV and is found at amino acids 247 to 252, the seventh has the sequence 
GAVVGT and is found at amino acids 255 to 260 of SEQ ID NO:39, the eighth has 

35 sequence GTLSSV and is found at amino acids 31 1 to 316 of SEQ ID NO:39, and the 
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ninth has the sequence GGVSSS and is found at amino acids 367 to 372 of SEQ ID 
NO:39. 

An in situ expression analysis of INTERCEPT 258 was performed as summarized 
herein. Mouse INTERCEPT 258 expression during embryogenesis (E73.5 to PI. 5 were 

5 examined) was observed throughout the animal in a punctate pattern. This pattern is very 
similar to that seen with the molecule PECAM-1, but at a lower intensity. PECAM-1 is an 
integrin expressed on endothelial cells. In addition, lung and brown fat exhibited a much 
higher signal in a more ubiquitous pattern in all embryonic stages examined. Heart and 
kidney also have a higher expression, but to a lesser degree. Adult mouse INTERCEPT 

10 258 expression was seen in many tissues, often in a multifocal, punctate pattern suggestive 
of vessels. Expression was also predominant in many highly vascularized tissues such as 
ovary (especially the septol region), kidney and adrenal cortex. 

In general, both embryonic and adult expression patterns were suggestive of 
endothelial cells being a component in the expression patters observed. In summary, 

15 tissues in which INTERCEPT 258 expression was observed were as follows: brain, eye, 
harderian gland, submanibular gland, bladder, brown fat, stomach, heart, kidney, adrenal 
gland, colon, liver, thymus, lymph node, spleen, spinal cord, ovary, testes and placenta. 

As shown in Figure 22, human INTERCEPT 258 protein and mouse INTERCEPT 
258 protein are 62.8% identical. 

20 Mouse INTERCEPT 258 exhibits homology to a human A33 antigen. Figure 25 

shows an alignment of mouse INTERCEPT 258 amino acid sequence (SEQ ED NO:39) 
with the human A33 amino acid sequence (SEQ ID NO:96). The alignment shows that 
there is a 23% overall amino acid sequence identity between the two sequences. Figures 
26A-26D show an alignment of the mouse INTERCEPT 258 nucleotide sequence (SEQ 

25 ID NO:37) with that of the human A33 nucleotide sequence (SEQ ID NO:97). The 
alignment shows that there is a 40% identity between these two nucleotide sequences. 

Clone EpTm258, which encodes mouse INTERCEPT 258, was deposited with the 
American Type Culture Collection (10801 University Boulevard, Manassas, VA 201 10- 
2209) on April 21, 1999 and assigned Accession Number 207221. This deposit will be 

30 maintained under the terms of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent Procedure. This deposit was made 
merely as a convenience for those of skill in the art and is not an admission that a deposit 
is required under 35 U.S.C. §112. 

35 Uses of INTERCEPT 258 Nucleic acids. Polypeptides, and Modulators Thereof 
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INTERCEPT 258 was identified as being expressed in a mixed lymphocyte library. 
In light of this, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate processes involved in lymphocyte development, differentiation and 
activity, including, but not limited to development, differentiation and activation of T 

5 cells, including T helper, T cytotoxic and non-specific T killer cell types and subtypes, and 
B cells, immune functions associated with such cells, and amelioration of one or more 
symptoms associated with abnormal function of such cell types. Such disorders can 
include, but are not limited to, autoimmune disorders, such as organ specific autoimmune 
disorders, e.g., autoimmune thyroiditis, Type I diabetes mellitus, insulin-resistant diabetes, 

10 autoimmune anemia, multiple sclerosis, and/or systemic autoimmune disorders, e.g., 
rheumatoid arthritis, lupus or sclerodoma, allergy, including allergic rhinitis and food 
allergies, asthma, psoriasis, graft rejection, transplantation rejection, graft versus host 
disease, pathogenic susceptibilities, e.g., susceptibility to certain bacterial or viral 
pathogens, wound healing and inflammatory reactions. 

1 5 INTERCEPT 258 includes one or more Ig domains. INTERCEPT 258 nucleic 

acids, proteins, and modulators thereof can, therefore, be used to modulate immune 
function, e.g., by the modulation of immunoglobulins and the formation of antibodies. 
For the same reason, INTERCEPT 258 nucleic acids, proteins, and modulators thereof can 
be used to modulate immune response, leukocyte trafficking, cancer, Type I immunologic 

20 disorders, e.g., anaphylaxis and/or rhinitis, by modulating the interaction between antigens 
and cell receptors, e.g., high affinity IgE receptors. 

INTERCEPT 258 exhibits homology to PECAM-1, a cell adhesion integrin 
molecule that has been shown to mediate cell-cell interactions, play an important role in 
bidirectional signal transduction, and may be involved in thrombotic, inflammatory and 

25 immunological disorders. As such, INTERCEPT 258 nucleic acids, proteins, and 
modulators thereof can be utilized to modulate cell/cell interactions and, for example, 
signal transduction events associated with such interactions. For example, such 
INTERCEPT 258 compositions and modulators thereof can be used to modulate binding 
of cellular factors or ECM-associated factors such as integrin and can function to modulate 

30 ligand binding to cell surface receptors. Further, such INTERCEPT 258 compositions and 
modulators thereof can be utilized to ameliorate at least one symptom associated with 
thrombotic disorders, e.g., stroke, inflammatory processes or disorders, and immune 
disorders. 

In light of INTERCEPT 258 expression, INTERCEPT 258 nucleic acids, proteins 
35 and modulators thereof can be utilized modulate development, differentiation, 

proliferation and/or activity of pulmonary system cells, e.g., lung cell types, and to 
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modulate a symptom associated with disorders of pulmonary development, differentiation 
and/or activity, such as lung diseases or disorders associated with abnormal pulmonary 
development or function, e.g., cystic fibrosis. INTERCEPT 258 nucleic acids, proteins 
and modulators thereof can also be utilized modulate development, differentiation, 

5 proliferation and/or activity of thyroid cells, megakaryocytes or mammary gland cells, 

and can further be utilized to ameliorate at least one symptom of disorders associated with, 
abnormal thyroid function, e.g., thyroiditis or Grave's disease, abnormal megakaryocyte 
differentiation or function, e.g., anemias or leukemias, hematological diseases such as 
thrombocytopenia, platelet disorders and bleeding disorders, such as hemophilia or 

1 o abnormal mammary development or function. 

INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can be used to 
treat renal (kidney) disorders, such as glomerular diseases (e.g., acute and chronic 
glomerulonephritis, rapidly progressive glomerulonephritis, nephrotic syndrome, focal 
proliferative glomerulonephritis, glomerular lesions associated with systemic disease, such 

15 as systemic lupus erythematosus, Goodpasture's syndrome, multiple myeloma, diabetes, 
polycystic kidney disease, neoplasia, sickle cell disease, and chronic inflammatory 
diseases), tubular diseases (e.g., acute tubular necrosis and acute renal failure, polycystic 
renal diseasemedullary sponge kidney, medullary cystic disease, nephrogenic diabetes, and 
renal tubular acidosis), tubulointerstitial diseases (e.g., pyelonephritis, drug and toxin 

20 induced tubulointerstitial nephritis, hypercalcemic nephropathy, and hypokalemic 
nephropathy), acute and rapidly progressive renal failure, chronic renal failure, 
nephrolithiasis, gout, vascular diseases (e.g., hypertension and nephrosclerosis, 
microangiopathic hemolytic anemia, atheroembolic renal disease, diffuse cortical necrosis, 
and renal infarcts), or tumors (e.g., renal cell carcinoma and nephroblastoma). 

25 INTERCEPT 258 nucleic acids, polypeptides, or modulators thereof can also be used to treat 
disorders of the brain, such as cerebral edema, hydrocephalus, brain herniations, iatrogenic 
disease (due to, e.g., infection, toxins, or drugs), inflammations (e.g., bacterial and viral 
meningitis, encephalitis, and cerebral toxoplasmosis), cerebrovascular diseases (e.g., hypoxia, 
ischemia, and infarction, intracranial hemorrhage and vascular malformations, and 

30 hypertensive encephalopathy), and tumors (e.g., neuroglial tumors, neuronal tumors, tumors 
of pineal cells, meningeal tumors, primary and secondary lymphomas, intracranial tumors, 
and medulloblastoma), and to treat injury or trauma to the brain. 

INTERCEPT 258 nucleic acids, proteins, and modulators thereof can still further 
be utilized to modulate development, differentiation proliferation and/or activity of cells 
involved in kidney or heart formation and function. In addition, such compositions and 
modulators thereof can be utilized to ameliorate at least one symptom of disorders 
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associated with abnormal kidney or heart formation or function, including, but not limited 
to nephritis, coronary disease, atherosclerosis and plaque formation. 

INTERCEPT 258 expression indicates that INTERCEPT 258 is involved, in 
addition to the above, in such processes as thermogenesis, adipocyte function, and 

5 vascularization. As such, INTERCEPT 258 nucleic acids, proteins, and modulators 

thereof can be utilized to modulate such processes as well as for ameliorating at least one 
symptom associated with such processes. Such disorders include, but are not limited to 
obesity, regulation of body temperature, and disorders involving abnormal vascularization, 
e.g., vascularization of solid tumors. 

1 0 In further light of INTERCEPT 258 expression, as well as in light of its homology 

to A33 antigen, INTERCEPT 258 nucleic acids, proteins and modulators thereof can be 
utilized to modulate cell proliferation, including, for example, epithelial, e.g., 
gastrointestinal tract epithelial cell proliferation, and to ameliorate at least one symptom of 
cell proliferative disorders such as cancer, and, in particular, chronic myelogenous 

15 leukemia, colon cancers, small bowel epithelium cancers and other gastrointestinal tract 
cancers. Further, INTERCEPT 258 expression can be utilized as a marker for specific 
tissues (e.g., vascularized tissues) and/or cells (e.g., endothelial cells) in which 
INTERCEPT 258 is expressed. INTERCEPT 258 nucleic acids can also be utilized for 
chromosomal mapping. 

20 

Human TANGO 281 

A cDNA encoding human TANGO 28 1 was identified by analyzing the sequences of 
clones present in a human megakarocyte cDNA library. This analysis led to the identification 
of a clone, AThPb8 1 d 1 0, encoding full-length human TANGO 28 1 . The human TANGO 28 1 

25 cDNA of this clone is 1812 nucleotides long (Figures 28A-28B; SEQ ID NO:46). The open 
reading frame of this cDNA, nucleotides 65 to 799 of SEQ ID NO:46 (SEQ ID NO:47), 
encodes a 245 amino acid transmembrane protein (Figures 28A-28B; SEQ ID NO:48). 

The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 
Engineering 10:1-6) predicted that human TANGO 281 includes an 38 amino acid signal 

30 peptide (amino acid 1 to amino acid 38 of SEQ ED NO:48; SEQ ID NO:49) preceding the 
mature TANGO 281 protein (corresponding to amino acid 39 to amino acid 245 of SEQ ID 
NO:48; SEQ ID NO:50). The molecular weight of TANGO 281 without post-translational 
modifications is 26.5 kDa prior to the cleavage of the signal peptide, 20.2 kDa after cleavage 
of the signal peptide. 

35 Human TANGO 281 is a transmembrane protein which contains one or more of the 

following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
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cytoplasmic domain. The human TANGO 281 protein contains an extracellular domain at 
amino acids 1 to 123 of SEQ ED NO:48 or a mature extracellular domain at about amino acid 
residues 39 to 123 of SEQ ID NO:48 (SEQ ID NO:51), a transmembrane domain at amino 
acid residues 124 to 148 of SEQ ED NO:48 (SEQ ID NO:52), and a cytoplasmic domain at 

5 amino acid residues 149 to 245 of SEQ ED NO:48 (SEQ ED NO:53). 

Figure 29 depicts a hydropathy plot of human TANGO 281 . Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 

1 0 trace. The dashed vertical line separates the signal sequence (amino acids 1 to 38 of SEQ ID 
NO:48; SEQ ED NO:49) on the left from the mature protein (amino acids 38 to 245 of SEQ 
ID NO:48; SEQ ID NO:50) on the right. 

Human TANGO 281 comprises photosystem II 10 kD phosphoprotein (PSBH) 
domain sequences, which have been shown to be phosphorylated in a light-dependent 

15 reaction, from amino acids 41 to 90 and 127 to 182 of SEQ ID NO:48 (SEQ ID NO:54 and 
SEQ ID NO:55, respectively). Figure 30 depicts an alignment between the PSBH domain 
(SEQ ID NO:69; Accession No. PF00737) and human TANGO 281 from amino acids 97 to 
146 of SEQ ID NO:48. An N-glycosylation site having the sequence NTTT is present in 
TANGO 281 at about amino acids 160 to 163 of SEQ ID NO:48. Two protein kinase C 

20 phosphorylation sites are present in human TANGO 28 1 . The first has the sequence S VR (at 
amino acids 8 to 10 of SEQ ID NO:48), and the second has the sequence SSR (at amino acids 
87 to 89 of SEQ ID NO:48). Three casein kinase II phosphorylation sites are present in 
human TANGO 281 . The first has the sequence SEPE (at amino acids 49 to 52 of SEQ ID 
NO:48), the second has the sequence SCPD (at amino acids 53 to 56 of SEQ ID NO:48), and 

25 the third has the sequence SSLD (at amino acids 108 to 111 of SEQ ID NO:48). Human 
TANGO 281 has two N-myristylation sites. The first has the sequence GSCSSQ (at amino 
acids 60 to 65 of SEQ ID NO:48), and the second has the sequence GATVAI (at amino acids 
119 to 124 ofSEQIDNO:48). 

Nucleic acid base pairs 413 to 746 of human TANGO 281 (SEQ ID NO:46) have 

30 8 1 % identity to the nucleic acid sequence identified as Accession Number A V34245. Nucleic 
acid base pairs 438 to 746 of human TANGO 281 (SEQ ID NO:46) have 80% identity to a 
nucleic acid sequence referred to as "gene 31" described in PCT Publication No. WO 
98/39446 (SEQ ID NO:70). "Gene 3 1" is characterized as being expressed primarily in brain 
and thymus, and to a lesser extent in such organs as liver, skin, bone and bone marrow. 

35 Clone EpT281 was deposited with the American Type Culture Collection (10801 

University Boulevard, Manassas, V A 20 1 1 0-2209) on April 2 1 , 1 999 and assigned Accession 
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Number 207222. This deposit will be maintained under the terms of the Budapest Treaty on 
the International Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in the art and 
is not an admission that a deposit is required under 35 U.S.C § 1 12. 

5 

Mouse TANGO 281 

A cDNA encoding mouse TANGO 281 was identified in a normal mouse 
megakaryocyte library by performing expression profiling on megakarocytes obtained from 
mice with a the deletion of the element of the gata-1 gene responsible for megakaryocyte- 
10 specific expression. This analysis led to the identification of a clone, Atmea49d3, encoding 
full-length mouse TANGO 281. The mouse TANGO 281 cDNA of this clone is 1858 
nucleotides long (Figure 30; SEQ ED NO:56). The open reading frame of this cDNA, 
nucleotides 90 to 728 of SEQ ID NO:56 (SEQ ID NO:57), encodes a 213 amino acid 
transmembrane protein (Figure 30; SEQ ID NO:58). 
15 The signal peptide prediction program SIGNALP (Nielsen, et al. (1997) Protein 

Engineering 10:1-6) predicted that mouse TANGO 281 includes an 26 amino acid signal 
peptide (amino acid 1 to amino acid 26 of SEQ ID NO:58; SEQ ID NO:59) preceding the 
mature TANGO 281 protein (corresponding to amino acid 27 to amino acid 213 of SEQ ID 
NO:58; SEQ ID NO:60). The molecular weight of mouse TANGO 281 without post- 
20 translational modifications is 22.9 kDa prior to the cleavage of the signal peptide, 20.2 kDa 
after cleavage of the signal peptide. 

Mouse TANGO 281 is a transmembrane protein which contains one or more of the 
following domains: (1) an extracellular domain; (2) a transmembrane domain; and (3) a 
cytoplasmic domain. The mouse TANGO 281 protein contains an extracellular domain at 
25 amino acid residues 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:61), a transmembrane domain 
at amino acid residues 1 13 to 137 of SEQ ID NO:58 (SEQ ID NO:62), and a cytoplasmic 
domain at amino acid residues 138 to 213 of SEQ ID NO:58 (SEQ ID NO:63). 

Figure 32 depicts a hydropathy plot of mouse TANGO 281. Relatively hydrophobic 
regions of the protein are shown above the horizontal line, and relatively hydrophilic regions 
30 of the protein are below the horizontal line. The cysteine residues (cys) and potential 
N-glycosylation sites (Ngly) are indicated by short vertical lines just below the hydropathy 
trace. The dashed vertical line separates the signal sequence (amino acids 1 to 26 of SEQ ID 
NO:58; SEQ ID NO:59) on the left from the mature protein (amino acids 27 to 213 of SEQ 
ID NO:58; SEQ ID NO:60) on the right. 
3 5 Mouse TANGO 28 1 comprises photosystem II 1 0 kD phosphoprotein (PSBH) domain 

sequences, which have been shown to be phosphorylated in a light-dependent reaction, from 
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amino acids 42 to 91 and 128 to 183 of SEQ ID NO: 58 (SEQ ED NO:64 and SEQ ID NO:65, 
respectively). Two N-glycosylation sites having the sequences NTTT (at amino acids 149 
to 152 of SEQ ID NO:58) and NASS (at about amino 189 to 192 of SEQ ID NO:58) are 
present in TANGO 281 . A glycosaminoglycan attachment site having the sequence SGFG 

5 is present in mouse TANGO 281, and protein kinase C phosphorylation site having the 
sequence SSR is present in mouse TANGO 281 . Two casein kinase II phosphorylation sites 
are present in human TANGO 281. The first has the sequence TPAE (at amino acids 80 to 
83 of SEQ ID NO:58), and the second has the sequence SSFD (at amino acids 97 to 100 of 
SEQ ID NO:58). Mouse TANGO 281 has two N-myristylation sites. The first has the 

10 sequence GSCSNQ (at amino acids 48 to 53 of SEQ ID NO:58), and the second has the 
sequence GATVAI (at amino acids 1 08 to 1 1 3 of SEQ ID NO:58). 

Northern blot analysis of mouse TANGO 28 1 expression revealed two mRN A bands, 
one of approximately 1 .8 kb and another approximately 1 .4 kb. Expression of the 1 .8 kb band 
was detected in the heart, spleen, lung and kidney, with the greatest abundance detected in the 

1 5 heart and lung, followed by the kidney and trace amounts in the spleen. Expression of the 1 .4 
kb band was detected in the brain, spleen, and lung. Expression of the 1.4 kb and 1.8 kb 
species of mouse TANGO 28 1 was detected in 7 day old normal mouse embryos. Neither the 
1 .4 kb or the 1 .8 kb species of mouse TANGO 28 1 were detected in 1 1 day old normal mouse 
embryos. The 1.8 kb species of mouse TANGO 281 was detected in 15 day old normal 

20 mouse embryos at 20 % the level detected in 7 day old normal mouse embryos. Expression 
of the 1.8 kb species detected in 17 day old normal mouse embryos was comparable to the 
level of expression detected in 7 day old normal mouse embryos. Expression of mouse 
TANGO 281 expression was greatly reduced in megakaryocytes obtained from gata-1 
knockout mice. 

25 In situ tissue screening was performed on mouse adult and embryonic tissues to 

analyze for the expression of mouse TANGO 281 mRNA. Mouse TANGO 281 expression 
was detected predominantly in the adult lymphoid tissues such as the thymus, lymph node, 
and spleen. In particular, mouse TANGO 28 1 expression was detected in the following adult 
tissues: a moderate, ubiquitous signal was detected in the submandibular gland; a strong, 

30 ubiquitous signal was detected in the adrenal gland; a strong, multifocal signal was detected 
in the medulla of the thymus and a moderate, ubiquitous signal was detected in the cortex of 
the thymus; a strong signal was detected in the lymph node; a strong signal was detected in 
the follicles of the spleen; a weak signal was detected in the mucosal epithelium of the 
bladder; a strong signal was detected in the ovaries; a ubiquitous signal was detected in the 

35 placenta; a moderate signal was detected in the muscle region of the stomach; a weak signal 
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in a pattern outlining many of the large airways was detected in lung; a weak, ubiquitous 
signal was detected in the liver; and a weak, ubiquitous signal was detected in the kidney. 

In the case of embryonic expression, mouse TANGO 281 expression was detected in 
the lung, stomach, thymus and submaxillary gland. In particular, at El 6.5 a weak to moderate 
5 signal was detected in the intestine and stomach, and a moderate, ubiquitous signal was 
detected in the lung. At PL5, a signal was detected in the lung, stomach, thymus, and 
submaxillary gland. 

Figure 33 shows that there is an overall 66.5% identity between the precursor human 
TANGO 281 amino acid sequence and the precursor mouse TANGO 281 amino acid 
10 sequence. 

Clone EpT281 was deposited with the American Type Culture Collection (10801 
University Boulevard, Manassas, VA 20110-2209) on June 15, 1999 and assigned patent 
deposit Number PTA-224. This deposit will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes 
15 of Patent Procedure. This deposit was made merely as a convenience for those of skill in the 
art and is not an admission that a deposit is required under 35 U.S.C. §112. 

Uses of TANGO 281 Nucleic acids. Polypeptides, and Modulators Thereof 

As TANGO 281 was originally found in a megakaryocyte library, TANGO 281 

20 nucleic acids, proteins, and modulators thereof can be used to modulate the proliferation, 
differentiation, and/or function of megakaryocytes and platelets. TANGO 28 1 nucleic acids, 
proteins, and modulators thereof can be used to treat associated hematological diseases such 
as thrombocytopenia, platelet disorders and bleeding disorders (e.g., hemophilia). TANGO 
281 nucleic acids, proteins, and modulators thereof can be used to modulate platelet 

25 aggregation and degranulation. Further, as TANGO 28 1 expression varies in mouse embryos 
during development, TANGO 28 1 nucleic acids, proteins, and modulators thereof can be used 
to modulate the development of cells, tissues or organs in embryos. 

As TANGO 28 1 expression is greatly reduced in megakaryocytes obtained from gata- 
1 knockout mice compared normal mice, TANGO 281 is either a direct or indirect target of 

30 gata-1 and has profound biological implications. Gata-1 is a transcription factor involved in 
the development of hemapoietic cell lineages ~ gata-1 expression is required for proper 
development of erythocytes and megakaryocytes. Although deletion of the gata-1 gene is 
lethal at the embryonic stage due to a failure to form red blood cells, deletion of only the 
element of the gata- 1 gene responsible for megakaryocyte-speci fic expression (a 1 0 kb region 

35 of genomic DNA containing a megakaryocyte specific DNase I hypersensitive) is not lethal 
and results in a reduction in gata-1 expression in the megakaryocyte without affecting gata-1 
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expression in red blood cells. The megakaryocytes of mice with this element of the gata-1 
gene knocked out fail to develop into mature platelets, and the mice experience abnormal 
bleeding due to their profound thrombocytopenia. TANGO 281 nucleic acids, proteins, and 
modulators thereof can be used to treat disease and/or disorders associated with gata-1 
5 dysfunction. In light of the reduced expression of TANGO 281 in gata-1 knockout mice, 
TANGO 281 expression can be utilized as a marker for modulators of gata-1 expression 
and/or activity. 

As TANGO 28 1 is expressed in the heart, brain, spleen, lung, kidney, embryo and 
megakaryocytes, TANGO 28 1 nucleic acids, proteins, and modulators thereof can be used to 

1 0 treat disorders of these cells, tissues, or organs, e.g. , ischemic heart disease or atherosclerosis, 
head trauma, brain cancer, splenic lymphoma, splenomegaly, lung cancer, cystic fibrosis, 
rheumatoid lung disease, glomerulonephritis, end stage renal disease, uremia, DiGeorge 
syndrome, thymoma, autoimmune disorders, atresia, Crohns's disease, and various embryonic 
disorders. TANGO 281 nucleic acids, proteins, and modulators thereof can be used to 

1 5 modulate the bleeding associated with uremia. Further, TANGO 28 1 nucleic acids, proteins, 
and modulators thereof can be used to treat hypercoagulation associated with a damaged 
endothelium, e.g., pre-eclampsia, malignant hypertension, disseminated intravascular 
coagulopathy, renal transplant rejection, cyclosporin toxicity, microangiopathic hemolytic 
anemia, and thrombotic thrombocytopenic purpura. 

20 TANGO 28 1 exhibits homology to a gene referred to as "gene 3 1 M (PCT Publication 

No. W098/39446), which is expressed primarily in the brain and thymus. In light of this, 
TANGO 281 nucleic acids, proteins and modulators thereof can be utilized to ameliorate at 
least one symptom associated with central nervous (CNS) disorders, hematopoietic disorder, 
and disorders of the endocrine system. 

25 Further, in light of TANGO 28 l's pattern of expression in mice, TANGO 281 

expression can be utilized as a marker for specific tissues (e.g., lymphoid tissues such as the 
thymus and spleen) and/or cells (e.g. , lymphocytes) in which INTERCEPT 28 1 is expressed. 
TANGO 281 nucleic acids can also be utilized for chromosomal mapping. 

Tables 1-4 below provide a summary of the sequence information for TANGO 253, 

30 TANGO 257, INTERCEPT 258 and TANGO 28 1 . 

TABLE 1: Summary of Human TANGO 253, TANGO 257, INTERCEPT 258, and 
TANGO 281 Sequence Information 



35 



Gene 


cDNA 


ORF 


Figure 


Accession 
Number 


TANGO 253 


SEQDD NO:l 


SEQ ID NO:2 


Figure 1 


207222 


TANGO 257 


SEQIDNO:15 


SEQIDNO:16 


Figures 9A-9B 


207222 
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INTERCEPT 
258 


SEQ ID NO:26 


SEQ ID NO:27 


Figurel7 


207222 


TANGO 281 


SEQ ID NO:46 


SEQ ID NO:47 


Figures 27 


207222 
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10 



15 



20 



25 



30 



35 
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Cyioplasmic 






aa 225-246 
of 

SEQ ID NO:28 
(SEQIDNO:79) 


aa 149-245 
of 

SEQ ID NO:48 
(SEQ ID NO:53) 


Transmembrane 






aa 207-224 
of 

SEQ ID NO:28 

(SEQ ID NO:78); 

aa 247-271 of 
SEQ ID NO: 28 

(SEQ ID NO: 33) 


aa 124-148 
of 

SEQ ID NO:48 
(SEQ ID NO:52) 


Collagen 


aa 36-45 
of 

SEQ ID NO:3 (SEQ 
ID NO:6) 








Clq 


aa 102-232 
of 

SEQ ID NO:3 
(SEQ ID NO:7) 








OO 






aa 49-128; 

167-226 of 
SEQ ID 
NO:28 
(SEQ ID 
NO:35; 
SEQ ID 
NO:36) 




PSBH 








aa 41-90; 12- 
187 

of 

SEQ ID 
NO:48 

(SEQ ID 
NO:54; SEQ 
ID NO:55) 


Extracellular 






aa 30-206 of 
SEQ ID NO. 28 
(SEQ ID NO: 
76) 

aa 272-370 of 
SEQ ID NO: 28 
(SEQ ID NO: 
34) 


aa 39-123 
of 

SEQ IDNO:48 
(SEQEDNO:51) 


Mature Protein 


aa 16-243 
of 

SEQ ID NO:3 
(SEQ ID NO:4) 


aa 22-406 
of 

SEQ ID NO: 17 
(SEQ ID NO: 18) 


aa 30-370 
of 

SEQ ID NO:28 
(SEQ ID NO:29) 


aa 39-245 
of 

SEQ ID NO:48 
(SEQ ID NO:50) 


Signal 
Sequence 


aa 1-15 
of 

SEQ ID NO:3 
(SEQ ID NO:5) 


aa 1-21 
of 

SEQ ID NO: 17 
(SEQ ID NO: 19) 


aa 1-29 
of 

SEQ ID NO:28 
(SEQ IDNO:30) 


aa 1-38 
of 

SEQ ID NO:48 
(SEQ ED NO:49) 


Protein 


TANGO 253 


TANGO 257 


INTERCEPT 
258 


TANGO 281 
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TABLE 3: Summary of Mouse TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 Sequence Information 



Gene 


cDNA 


ORE 


Figure 


Accession 
Number 


TANGO 253 


SEQIDNO:8 


SEQ ID NO:9 


Figures 3A-3B 


207215 


TANGO 257 


SEQIDNO:21 


SEQ ID NO:22 


Figures 11A-11B 


207217 


INTERCEPT 
258 


SEQ ID NO:37 


SEQ ID NO:38 


Figures 20A-20B 


207221 


TANGO 281 


3EQIDNO:56 


SEQ ID NO: 57 


Figures 31A-31B 


PTA-224 
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Cytoplasmic 






275-394 
of 

SEQ ID NO:39 
(SEQ ID NO:45) 


aa 138-213 
of 

SEQ ID NO:58 
(SEQ ID NO:63 


Transmembrane 






250 to 274 
of 

SEQ ID NO:39 
(SEQ ID NO:44) 


aa 113-137 
of 

SEQ ID NO:58 
(SEQ ID NO:62) 


Collagen 


aa 36-95 
of 

SEQ ID NO: 10 
(SEQ ID NO: 14) 








Clq 


aa 105-232 
of 

SEQ ID NO: 10 
(SEQ ID NO: 13) 








op 






aa 170-229 
of 

SEQ ID NO:39 

(SEQ ID 
NO:46) 




PSBH 








aa 42-91; 128-183 
of 

SEQ ID NO:58 

(SEQ ID NO:64; 
SEQIDNO:65) 


Extracellular 






30-249 
of 

SEQ ID NO:39 

(SEQ ID 
NO:83) 


aa 27-1 12 
of 

SEQIDNO:58 

(SEQ ID 
NO:61) 


Mature Protein 


aa 16-243 
of 

SEQ ID NO: 10 
(SEQ ID NO: 11) 


aa 22-406 
of 

SEQ ID NO:23 
(SEQ ID NO:24) 


aa 30-394 
of 

SEQ ID NO:39 
(SEQ ID NO:40) 


aa 27-213 
of 

SEQ ID NO:58 
(SEQ ID NO:60) 


Signal 
Sequence 


aa 1-15 
of 

SEQ ID NO: 10 
(SEQ CD 
NO: 12) 


aa 1-21 
of 

SEQ ID NO:23 
(SEQ ID 
NO:25) 


aa 1-29 

of 1 

SEQ ID NO:39 

(SEQ ID 
NO:41) 


aa 1-26 
of 

SEQ ID NO:58 

(SEQ ID 
i NO:59) 


Protein j 


TANGO 253 


TANGO 257 


INTERCEPT 
258 


TANGO 281 
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Various aspects of the invention are described in further detail in the following subsections: 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
a polypeptide of the invention or a biologically active portion thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes to identify nucleic acid molecules 
encoding a polypeptide of the invention and fragments of such nucleic acid molecules 
suitable for use as PCR primers for the amplification or mutation of nucleic acid 
molecules. As used herein, the term "nucleic acid molecule" is intended to include DNA 
molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs 
of the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule can 
be single-stranded or double-stranded, but preferably is double-stranded DNA. In one 
embodiment, the nucleic acid molecules of the invention comprise a contiguous open 
reading frame encoding a polypeptide of the invention. 

15 An "isolated" nucleic acid molecule is one which is separated from other nucleic 

acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 

20 nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, or 

25 culture medium when produced by recombinant techniques, or substantially free of 

chemical precursors or other chemicals when chemically synthesized. As used herein, the 
term "isolated" when referring to a nucleic acid molecule does not include an isolated 
chromosome. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
30 having the nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 
47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 
125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 
161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 
181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or a complement thereof, 
35 can be isolated using standard molecular biology techniques and the sequence information 
provided herein. Using all or a portion of the nucleic acid sequences of SEQ ID NO:l, 2, 
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8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 103, 105, 107, 109, 
111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 
147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 
174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 
5 or 192, as a hybridization probe, nucleic acid molecules of the invention can be isolated 
using standard hybridization and cloning techniques (e.g., as described in Sambrook et al., 
eds., Molecular Cloning: A Laboratory Manual, 2nd ed.. Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

A nucleic acid molecule of the invention can be amplified using cDNA, mRNA or 
10 genomic DNA as a template and appropriate oligonucleotide primers according to 

standard PCR amplification techniques. The nucleic acid so amplified can be cloned into 
an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 
invention can be prepared by standard synthetic techniques, e.g., using an automated DNA 
* 5 synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 
91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the cDNA insert 
of a clone deposited with the ATCC® as Accession number 207222, Accession Number 
207215, Accession number 207217, Accession Number 207221 or patent deposit Number 
PTA-224, or a portion thereof. A nucleic acid molecule which is complementary to a 
given nucleotide sequence is one which is sufficiently complementary to the given 
nucleotide sequence that it can hybridize to the given nucleotide sequence thereby forming 
a stable duplex. 

30 Moreover, a nucleic acid molecule of the invention can comprise only a portion of 

a nucleic acid sequence encoding a full length polypeptide of the invention for example, a 
fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of a polypeptide of the invention. The nucleotide sequence determined from 
the cloning one gene allows for the generation of probes and primers designed for use in 

35 identifying and/or cloning homologues in other cell types, e.g., from other tissues, as well 
as homologues from other mammals. The probe/primer typically comprises substantially 
purified oligonucleotide. In one embodiment, the oligonucleotide comprises a region of 
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nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, more preferably about 50, 75, 100, 125, 150, 175, 200, 250, 300, 350 or 400 
consecutive oligonucleotides of the sense or anti-sense sequence of SEQ ID NO:l, 2, 8, 9, 
15, 16,21,22, 26, 27,37, 38,46,47, 56,57, 77, 80,91, 100, 101, 103, 105, 107, 109, 1 1 1, 

5 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 
175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 
192, or the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® 
as Accession number 207222, Accession Number 207215, Accession Number 207217, 

1 0 Accession Number 20722 1 , or patent deposit Number PTA-224, or of a naturally 

occurring mutant of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80,91, 100, 101, 103, 104, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 
165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 

15 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. In another embodiment, the 

oligonucleotide comprises a region of nucleotide sequence that hybridizes under stringent 
conditions to at least 400, preferably 450, 500, 530, 550, 600, 700, 750, 800, 850, 900, 
1000, 1 100, 1200 or more consecutive oligonucleotides of the sense of antisense sequence 
of SED ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 

20 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 
137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 
169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 
187, 188, 189, 190, 191 or 192 , or the nucleotide sequence of the cDNA insert of a clone 
deposited with the ATCC® as Accession number 207222, Accession number 207215, 

25 Accession number 2072 1 7, Accession Number 20722 1 , or patent deposit Number PTA- 
224, or of a naturally occurring mutant of SEQ ID NO: 1, 2, 8, 9, 15, 16, 21, 22, 26, 27, 
37, 38, 46,47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 
157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 

30 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192. 

In a preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 450, 
preferably about 500, 550, 600, 650, 700, 750, 800, 850, 900, 1000, 1 100 or 1300 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:l, 103, 105, 
35 107 or 109, or a naturally occurring mutant of SEQ ID NO:l, 103, 105, 107, or 109. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 



-66- 



WO 00/78808 



PCT/US00/16883 



nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably 25, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 720 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:2, 91, 100, 101 
or 80. 

5 In another preferred embodiment, the oligonucleotide typically comprises a region 

of nucleotide sequence that hybridizes under stringent conditions to at least about 540, 
preferably about 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:8, 1 19, 121 , 
123 or 125, or of a naturally occurring mutant of SEQ ID NO:8, 1 19, 121, 123 or 125. In 

1 0 another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 310, 
preferably about 350, 400, 450, 500, 550, 600, 650 or 700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:9, 174, 175, 176 or 177, or of a naturally 
occurring mutant of SEQ ID NO:9, 174, 175, 176 or 177. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 1800 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ED NO: 15, 111,113, 
1 1 5 or 1 17, or of a naturally occurring mutant of SEQ ID NO: 1 5, 1 1 1 , 1 1 3, 1 1 5 or 1 1 7. In 
^ another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 1 150 
consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO: 16, 170, 171, 
172 or 173, or of a naturally occurring mutant of SEQ ID NO:16, 170,171, 172 or 173. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
25 of nucleotide sequence that hybridizes under stringent conditions to at least about 1 100, 
preferably about 1200, 1300, 1400, 1500, 16500 or 1700 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:21, 127, 129, 131 or 133, or of a naturally 
occurring mutant of SEQ ID NO:21, 127, 129, 1 31 or 133. In another preferred 
embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
30 hybridizes under stringent conditions to at least about 1 150 consecutive nucleotides of the 
sense or anti-sense sequence of SEQ ID NO:22, 1 78, 1 79, 1 80 or 1 8 1 , or of a naturally 
occurring mutant of SEQ ID NO:22, 178, 179, 180 or 181. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 420, 
35 preferably about 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
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NO:26, 135, 137, 139 or 141, or ofa naturally occurring mutant of SEQ ID NO:26, 135, 
1 37, 1 39 or 141 . In another preferred embodiment, the oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
5 600,700,800,900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 consecutive 
nucleotides of the sense or anti-sense sequence of SEQ ID NO:27, 182, 183, 184 or 185, 
or ofa naturally occurring mutant of SEQ ID NO:27, 182, 183, 184 or 185. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 675, 

10 preferably about 700, 800, 900, 1000, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 

consecutive nucleotides of the sense or anti-sense sequence of SEQ ED NO:37, 143, 145, 
147 or 149, or of a naturally occurring mutant of SEQ ID NO:37, 143, 145, 147 or 149. In 
another preferred embodiment, the oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent conditions to at least about 500, 

15 preferably about 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 or 
1 800 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID NO:38, 1 86, 
187, 188 or 189, or ofa naturally occurring mutant of SEQ ID NO:38, 186, 187, 188 or 
189. 

^ In another preferred embodiment, the oligonucleotide typically comprises a region 

of nucleotide sequence that hybridizes under stringent conditions to at least about 12, 
preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 
900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of 
the sense or anti-sense sequence of SEQ ID NO:46, 151, 153, 155 or 157, or ofa naturally 
occurring mutant of SEQ ID NO:46, 151, 153, 155 or 157. In another preferred 
embodiment, the oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, preferably about 25, more 
preferably about 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 
1400, 1500, 1600, 1700 or 1800 consecutive nucleotides of the sense or anti-sense 
sequence of SEQ ID NO:47, 190, 191, 192 or 77, or ofa naturally occurring mutant of 
SEQ ED NO:47, 190, 191, 192 or 77. 

In another preferred embodiment, the oligonucleotide typically comprises a region 
of nucleotide sequence that hybridizes under stringent conditions to at least about 550, 
preferably about 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700, 
35 1 800 or 1 850 consecutive nucleotides of the sense or anti-sense sequence of SEQ ID 
NO:56, 159, 161, 163 or 165, or of a naturally occurring mutant of SEQ DD NO:56, 159, 
161, 163 or 165. In another preferred embodiment, the oligonucleotide typically 
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comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 50, 100, 200, 300, 400, 500, 
600 or 700 consecutive nucleotides of the sense or anti-sense sequence of SEQ ED NO:57, 
166, 167, 168 or 169, or of a naturally occurring mutant of SEQ ID NO:57, 166, 167, 168 
5 or 169. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences encoding the same protein molecule 
encoded by a selected nucleic acid molecule. The probe comprises a label group attached 
thereto, e.g., a radioisotope, a fluorescent compound, an enzyme, or an enzyme co- factor. 
10 Such probes can be used as part of a diagnostic test kit for identifying cells or tissues 
which mis-express the protein, such as by measuring levels of a nucleic acid molecule 
encoding the protein in a sample of cells from a subject, e.g., detecting mRNA levels or 
determining whether a gene encoding the protein has been mutated or deleted. 

A nucleic acid fragment encoding a biologically active portion of a polypeptide of 
the invention can be prepared by isolating a portion of any of SEQ ID NO:3, 10, 17, 23, 
28, 39, 48, 58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164 
expressing the encoded portion of the polypeptide protein (e.g., by recombinant expression 
^ in vitro) and assessing the activity of the encoded portion of the polypeptide. 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 
77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 
129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 

25 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 
183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, or the nucleotide sequence of the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit number PTA-224 due to degeneracy of the genetic code and thus encode the 

30 same protein as that encoded by the nucleotide sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 
21,22, 26, 27,37, 38, 46,47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 
151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 
176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, or 

35 the nucleotide sequence of the cDNA insert of a clone deposited with the ATCC® as 
Accession Number 207222, Accession Number 20721 5, Accession Number 207217, 
Accession Number 207221 or patent deposit Number PTA-224. 
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In addition to the nucleotide sequences of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27, 37, 38,46, 47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
5 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, it will be 
appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequence may exist within a population (e.g., the human 
population). Such genetic polymorphisms may exist among individuals within a 
population due to natural allelic variation. 

10 An allele is one of a group of genes which occur alternatively at a given genetic 

locus. As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. As used 
herein, the terms "gene" and "recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a polypeptide of the invention. Such natural 

*5 allelic variations can typically result in 1-5% variance in the nucleotide sequence of a 
given gene. Alternative alleles can be identified by sequencing the gene of interest in a 
number of different individuals. This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of individuals. Any and all such 
nucleotide variations and resulting amino acid polymorphisms or variations that are the 

20 result of natural allelic variation and that do not alter the functional activity are intended to 
be within the scope of the invention. 

The human gene for TANGO 253 has been mapped to the long arm of 
chromosome 11. Flanking markers for this region are Dl 151356 and Dl 15924. The 

^ Jacobsen syndrome (JBS), ED4 (ectodermal dysplasia 4), CMT4B (Charcot Marie Tooth 
neuropathy), PORC (porphyria, acute) loci also map to this region of the human 
chromosome. The APOPLP1 (apolipoprotein cluster), DRD2 (dopamine receptor 2), 
PGL1 (paraganglioma glomus tumors), RDX (radixin), NCAM1 (neural cell adhesion 
molecule), ARCN1 (archain 1), and IL-10R (DL-10 receptor) genes map to this region of 

^ the human chromosome. This region is syntenic to mouse chromosome 9. The ruf (rough 
fur), lu (luxoid), and atm (ataxia telangiectasia gene mutated in human being) loci also 
mpa to this region of the mouse chromosome. The ruf (rough fur), lu (luxoi), hmbs 
(hydroxymethylbilane synthase), IL-lORa (IL-10 receptor a), and drd2 (dopamine 
receptor 2) genes also map to this region of the mouse chromosome. 

35 The human gene for TANGO 257 has been mapped to chromosome 1 . Flanking 

markers for ths region are WI-7614 and FB14F9. The WS2B (Waardenburg syndrome) 
loci also maps to this region of the human chromosome. The NGF-P (nerve growth 
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factor-P), TSHB (thyroid stimulating hormone), and GSTM1 (glutathione S-transferase 
cluster) genes also map to this region of the human chromosome. This region is syntenic 
to mouse chromosome 3. The de (droopy ear) loci maps to this region of the mouse 
chromosome. The NGF-p (nerve growth factor-p), TSHB (thyroid stimulating hormone), 
5 and BCAN (brevican) genes also map to this region of the mouse chromosome. 

The human gene for INTERCEPT 258 has been mapped to the long arm of 
chromosome 1 1, in the region q23. Flanking markers for this region are Dl 1S936 and 
Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 (ecotodermal dysplasia), 
JBS (Jacobsen Syndrome), and TCPT (thrombocytopenia) loci also map to this region of 

10 the human chromosome. The APOLP1 (apolipoprotein cluster), DRD2 (dopamine 
receptor), and RDX (radixin) genes also map to this region of the human chromosome. 
This region is syntenic to mouse chromosome 9. The atm (ataxia telangiectasia), ruf 
(rough fur), and vs (variable spotting) loci map to this region of the mouse chromosome. 
The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), rug (rough fur), and 

15 lapl (leucine arylaminopeptidase) genes also map to this region of the mouse 
chromosome. 

Moreover, nucleic acid molecules encoding proteins of the invention from other 
species (homologies), which have a nucleotide sequence which differs from that of the 
^ human or mouse protein described herein are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of a cDNA of the invention can be isolated based on their identity to the 
human nucleic acid molecule disclosed herein using the human cDNAs, or a portion 
thereof, as a hybridization probe according to standard hybridization techniques under 
stringent hybridization conditions. For example, a cDNA encoding a soluble form of a 
membrane-bound protein of the invention isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the membrane-bound form. Likewise, a cDNA 
encoding a membrane-bound form can be isolated based on its hybridization to a nucleic 
acid molecule encoding all or part of the soluble form. 

30 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 1000, 1 100, 1200 or 
1300 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:l, 103, 105, 107 or 109, or a complement thereof. 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700 or 
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720 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:2, 80, 91, 100 or 101, or a complement thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 540, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1 100, 1200 or 1250 
contiguous nucleotides in length and hybridizes under stringent conditions to the nucleic 
acid molecule comprising the nucleotide sequence, preferably the coding sequence, of 
SEQ ID NO:8, 119, 121, 123 or 125, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 310, 350, 400, 450, 500, 550, 600, 650 or 700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:9, 
174, 175, 176 or 177, or a complement thereof 

15 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO: 15, 111, 113, 115 or 117, or a complement thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
20 invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO: 16, 170, 171, 172 or 173, or a complement 
thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

25 

invention is at least 1100, 1200, 1300, 1400, 1500, 1600 or 1700 contiguous nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule 
comprising the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:21, 
127, 129, 131 or 133, or a complement thereof 

3Q Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 1 150 or 1200 contiguous nucleotides in length and hybridizes under 
stringent conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:22, 178, 179, 180 or 181, or a complement 
thereof 

35 Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 420, 450, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 
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1500, 1600, 1700, or 1800 contiguous nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ED No:26, 135, 137, 139 or 141, or a complement thereof 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
5 invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
sequence, preferably the coding sequence, of SEQ ID NO:27, 182, 183, 1 84 or 1 85, or a 
complement thereof. 

10 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 675, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 1700 
or 1800 contiguous nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
sequence, of SEQ ID NO:37, 143, 145, 147 or 149, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 
1700 or 1800 contiguous nucleotides in length and hybridizes under stringent conditions 
to the nucleic acid molecule comprising the nucleotide sequence, preferably the coding 
20 sequence, of SEQ ID NO:38, 186, 187, 188 or 189, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 
1300, 1400, 1500, 1600, 1700 or 1800 contiguous nucleotides in length and hybridizes 
under stringent conditions to the nucleic acid molecule comprising the nucleotide 
25 sequence, preferably the coding sequence, of SEQ ID NO:46, 1 5 1 , 1 53, 1 55 or 1 57, or a 
complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600, 700 or 750 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 

30 

the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:47, 77 190, 191 
or 1 92, or a complement thereof. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 550, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400, 1500, 1600, 
35 1700, 1800 or 1850 contiguous nucleotides in length and hybridizes under stringent 

conditions to the nucleic acid molecule comprising the nucleotide sequence, preferably the 
coding sequence, of SEQ ID NO:56, 159, 161, 163 or 165, or a complement thereof. 
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Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 50, 100, 200, 300, 400, 500, 600 or 700 contiguous nucleotides in 
length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence, preferably the coding sequence, of SEQ ID NO:57, 166, 167, 168 
or 169, or a complement thereof. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at 
least 60%, 65%, 70%, preferably 75%, identical to each other typically remain hybridized 
to each other. Such stringent conditions are known to those skilled in the art and can be 
found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45° C, followed by one 
or more washes in 0.2 X SSC, 0.1% SDS at 50-65° C. Preferably, an isolated nucleic acid 
molecule of the invention that hybridizes under stringent conditions to the sequence of 
15 SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 91, 100, 101, 
103, 105, 107, 109, ill, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 
139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 
170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
188, 189, 190, 191 or 192 , or a complement thereof, corresponds to a naturally-occurring 
20 nucleic acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature 
(e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of the 
invention sequence that may exist in the population, the skilled artisan will further 
appreciate that changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of the 
protein. For example, one can make nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues. A "non-essential" amino acid residue 
is a residue that can be altered from the wild-type sequence without altering the biological 
activity, whereas an "essential" amino acid residue is required for biological activity. For 
example, amino acid residues that are not conserved or only semi-conserved among 
homologues of various species may be non-essential for activity and thus would be likely 
targets for alteration. Specific examples of conservative amino acid alterations from the 
original amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48 or 58 are shown in 
SEQ ID NO: 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 
132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164. 



25 



30 



35 



-74- 



WO 00/78808 



PCT/US00/16883 



Alternatively, amino acid residues that are conserved among the homologues of various 
species {e.g., mouse and human) may be essential for activity and thus would not be likely 
targets for alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
5 encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:3, 102, 104, 106 or 108, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 40%, 45%, 50%, 55%, 60%, 65%, 
10 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:3, 102, 104, 
106 or 108. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:10, 118, 120, 122 or 124 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 95%, or 98% identical to the amino 
acid sequence of SEQ ID NO:10, 118, 120, 122 or 124. 

20 Accordingly, another aspect of the invention pertains to nucleic acid molecules 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:17, 1 10, 1 12, 1 14 or 1 16 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 

25 includes an amino acid sequence that is at least about 88%, 90%, 95% or 98% identical to 
the amino acid sequence of SEQ ID NO: 17, 1 10, 1 12, 1 14 or 1 16. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
30 ID NO:23, 126, 128, 130 or 132 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 88%, 90%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO:23, 126, 128, 130 or 132. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 

35 

encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
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ID NO:28, 134, 136, 138, 140, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:28, 134, 136, 138, 
5 140. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:39, 142, 144, 146 or 148, yet retain biological activity. In one embodiment, the 
1 0 isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 45%, 50%, 55%, 60%, 65%, 75%, 
85%, 95%, or 98% identical to the amino acid sequence of SEQ ID NO:39, 142, 144, 146 
or 148. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:48, 150, 152, 154, or 156, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 

20 

60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 
NO:48, 150, 152, 154 or 156. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 

25 are not essential for activity. Such polypeptides differ in amino acid sequence from SEQ 
ID NO:58, 158, 160, 162 or 164, yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein that 
includes an amino acid sequence that is at least about 30%, 35%, 40%, 45%, 50%, 55%, 
60%, 65%, 75%, 85%, 95%, or 98% identical to the amino acid sequence of SEQ ID 

30 NO:58, 158, 160, 162 or 164. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56, 57, 77, 80, 
91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 
35 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 166, 
167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
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185, 186, 187, 188, 189, 190, 191 or 192 such that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be 
introduced by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

5 predicted non-essential amino acid residues. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a 
similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g., lysine, 
arginine, histidine), acidic side chains (eg., aspartic acid, glutamic acid), uncharged polar 

10 side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), 
nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and 
aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Alternatively, 
mutations can be introduced randomly along all or part of the coding sequence, such as by 

15 saturation mutagenesis, and the resultant mutants can be screened for biological activity to 
identify mutants that retain activity. Following mutagenesis, the encoded protein can be 
expressed recombinantly and the activity of the protein can be determined. 

In a preferred embodiment, a mutant polypeptide that is a variant of a polypeptide 
of the invention can be assayed for: (1) the ability to form protein: protein interactions 
20 with proteins in a signaling pathway of the polypeptide of the invention; (2) the ability to 
bind a ligand of the polypeptide of the invention; or (3) the ability to bind to an 
intracellular target protein of the polypeptide of the invention. In yet another preferred 
embodiment, the mutant polypeptide can be assayed for the ability to modulate cellular 
proliferation, cellular migration or chemotaxis, or cellular differentiation. 

25 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid encoding a polypeptide of the 
invention, e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 

^ hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be complementary 
to an entire coding strand, or to only a portion thereof, e.g., all or part of the protein 
coding region (or open reading frame). An antisense nucleic acid molecule can be 
antisense to all or part of a non-coding region of the coding strand of a nucleotide 
sequence encoding a polypeptide of the invention. The non-coding regions ("5' and 3* 

^ untranslated regions") are the 5' and 3' sequences which flank the coding region and are 
not translated into amino acids. 
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An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 
40, 45 or 50 nucleotides or more in length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 

5 oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the antisense 
and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted 
nucleotides can be used. Examples of modified nucleotides which can be used to generate 

10 the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 

5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 
5-carboxymethylaminomethyI-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

15 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 

20 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (/„&, RNA transcribed from the inserted nucleic acid will be of an antisense 

25 orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
a subject or generated in situ such that they hybridize with or bind to cellular mRNA , 
and/or genomic DNA encoding a selected polypeptide of the invention to thereby inhibit 

30 expression, e.g., by inhibiting transcription and/or translation. The hybridization can be 
by conventional nucleotide complementarity to form a stable duplex, or, for example, in 
the case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct 

35 injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
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administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
5 vectors described herein. To achieve sufficient intracellular concentrations of the 

antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an *-anomeric (alpha) 
nucleic acid molecule. An «-anomeric nucleic acid molecule forms specific 
10 double-stranded hybrids with complementary RNA in which, contrary to the usual P-units, 
the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res. 
1 5:6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes {e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide of the invention can be designed based 
upon the nucleotide sequence of a cDNA disclosed herein. For example, a derivative of a 
Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in a Cech et al. U.S. 
Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, an mRNA 
encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:141 1-1418. 

30 The invention also encompasses nucleic acid molecules which form triple helical 

structures. For example, expression of a polypeptide of the invention can be inhibited by 
targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 

35 (1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. NY. Acad. Sci. 660:27-36; 
and Maher (1992) Bioassays 14(12):807-15. 



20 
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In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids 

5 (see Hyrup et al. (1996) Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA 
mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of 
PNAs has been shown to allow for specific hybridization to DNA and RNA under 

10 conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996), supra; 
Peny-O'Keefe et al. (1996) Proc. Natl. Acad. ScL USA 93: 14670-675. 

PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense or antigene agents for sequence-specific modulation of gene 

^ expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., 
PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as probes or primers for 
DNA sequence and hybridization (Hyrup (1996), supra; Perry-O'Keefe et al. (1996) Proc. 

20 Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 
cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 

^ known in the art. For example, PNA-DNA chimeras can be generated which may 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g., RNAse H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 

^ of base stacking, number of bonds between the nucleobases, and orientation (Hyrup 
(1996), supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1 996), supra, and Finn et al. (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds such 
as 5 , -(4-methoxytrityl)amino-5 , -deoxy-thymidine phosphoramidite can be used as a link 
between the PNA and the 5* end of DNA (Mag et al. (1989) Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a stepwise manner to produce a 
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chimeric molecule with a 5' PNA segment and a 3* DNA segment (Finn et al. (1996) 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 
synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al. (1975) 
Bioorganic Med. Chem. Lett. 5:1119-11 124). 

^ In other embodiments, the oligonucleotide may include other appended groups 

such as peptides (e.g., for targeting host cell receptors in vivo ), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; PCT 
Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 

10 W0 89/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (see, e.g., Krol et al. (1988) Bio/Techniques 6:958-976) or intercalating 
agents (see, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, hybridization triggered 
cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 



H Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins, and biologically active 
portions thereof, as well as polypeptide fragments suitable for use as immunogens to raise 

20 antibodies directed against a polypeptide of the invention. In one embodiment, the native 
polypeptide can be isolated from cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant DNA techniques. Alternative 
to recombinant expression, a polypeptide of the invention can be synthesized chemically 

25 using standard peptide synthesis techniques. 

An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. The language "substantially 

30 

free of cellular material" includes preparations of protein in which the protein is separated 
from cellular components of the cells from which it is isolated or recombinantly produced. 
Thus, protein that is substantially free of cellular material includes preparations of protein 
having less than about 30%, 20%, 10%, or 5% (by dry weight) of heterologous protein 
(also referred to herein as a "contaminating protein"). When the protein or biologically 

35 

active portion thereof is recombinantly produced, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less than about 20%, 10%, or 5% of the 
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volume of the protein preparation. When the protein is produced by chemical synthesis, it 
is preferably substantially free of chemical precursors or other chemicals, i.e., it is 
separated from chemical precursors or other chemicals which are involved in the synthesis 
of the protein. Accordingly such preparations of the protein have less than about 30%, 
5 20%, 10%, 5% (by dry weight) of chemical precursors or compounds other than the 
polypeptide of interest. 

Biologically active portions of a polypeptide of the invention include polypeptides 
comprising amino acid sequences sufficiently identical to or derived from the amino acid 
sequence of the protein (e.g., the amino acid sequence shown in any of SEQ ID NO:4, 6, 

10 7, 13, 14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 
64, 65, 71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 
or 99 which include fewer amino acids than the full length protein, and exhibit at least one 
activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 

' 5 biologically active portion of a protein of the invention can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acids in length. Moreover, other biologically 
active portions, in which other regions of the protein are deleted, can be prepared by 
recombinant techniques and evaluated for one or more of the functional activities of the 
native form of a polypeptide of the invention. 

20 

Preferred polypeptides have the amino acid sequence of SEQ ID NO:4, 6, 7, 13, 
14, 18, 23, 28, 33, 34, 35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 
71, 76, 34, 78, 79, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99. 
Other useful proteins are substantially identical (e.g., at least about 45%, preferably 55%, 
65%, 75%, 85%, 95%, or 99%) to any of SEQ ID NO:4, 6, 7, 13, 14, 18, 23, 28, 33, 34, 
35, 36, 39, 42, 44, 45, 48, 51, 52, 53, 54, 55, 58, 61, 62, 63, 64, 65, 71, 76, 34, 78, 79, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, or 99 and retain the functional 
activity of the protein of the corresponding naturally-occurring protein yet differ in amino 
acid sequence due to natural allelic variation or mutagenesis. 

30 To determine the percent identity of two amino acid sequences or of two nucleic 

acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 

35 compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences is 
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a function of the number of identical positions shared by the sequences (i.e., % identity = 
# of identical positions/total # of positions (e.g., overlapping positions) x 100). In one 
embodiment, the two sequences are the same length. 

The determination of percent identity between two sequences can be accomplished 

^ using a mathematical algorithm. A preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of two sequences is the algorithm of Karlin and 
Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in Karlin and 
Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm is 
incorporated into the NBLAST and XBLAST programs of Altschul, et al. (1990) J. MoL 

10 Biol. 21 5:403-410. BLAST nucleotide searches can be performed with the NBLAST 
program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to a 
nucleic acid molecules of the invention. BLAST protein searches can be performed with 
the XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to a protein molecules of the invention. To obtain gapped alignments for 

15 comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. 

(1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be used to perform 
an iterated search which detects distant relationships between molecules (Id.). When 
utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the default parameters of the 
respective programs (e.g., XBLAST and NBLAST) can be used. See 

2® http://www.ncbi.nlm.nih.gov. 

Another preferred, non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such 
an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the 

^ CGC sequence alignment software package. When utilizing the ALIGN program for 

comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 
12, and a gap penalty of 4 can be used. Additional algorithms for sequence analysis are 
known in the art and include ADVANCE and ADAM as described in Torellis and Robotti 
(1994) Comput. Appl. Biosci., 70:3-5; and FASTA described in Pearson and Lipman 

3q (1988) Proc. Natl. Acad. Sci. 55:2444-8. Within FASTA, ktup is a control option that sets 
the sensitivity and speed of the search. If ktup=2, similar regions in the two sequences 
being compared are found by looking at pairs of aligned residues; if ktup=l, single aligned 
amino acids are examined, ktup can be set to 2 or 1 for protein sequences, or from 1 to 6 
for DNA sequences. The default if ktup is not specified is 2 for proteins and 6 for DNA. 

^ For a further description of FASTA parameters, see 

http://bioweb.pasteur.fr/docs/man/man/fasta. 1 .html#sect2, the contents of which are 
incorporated herein by reference. 
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The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 
identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins. As used herein, a 
5 "chimeric protein" or "fusion protein" comprises all or part (preferably biologically active) 
of a polypeptide of the invention operably linked to a heterologous polypeptide (i.e., a 
polypeptide other than the same polypeptide of the invention). Within the fusion protein, 
the term "operably linked" is intended to indicate that the polypeptide of the invention and 
the heterologous polypeptide are fused in-frame to each other. The heterologous 
1 0 polypeptide can be fused to the N-terminus or C-terminus of the polypeptide of the 
invention. 

One useful fusion protein is a GST fusion protein in which the polypeptide of the 
invention is fused to the C-terminus of GST sequences. Such fusion proteins can facilitate 
the purification of a recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal sequence 
at its N-terminus. For example, the native signal sequence of a polypeptide of the 
invention can be removed and replaced with a signal sequence from another protein. For 
example, the gp67 secretory sequence of the baculovirus envelope protein can be used as a 

20 heterologous signal sequence {Current Protocols in Molecular Biology, Ausubel et al., 
eds., John Wiley & Sons, 1992). Other examples of eukaryotic heterologous signal 
sequences include the secretory sequences of melittin and human placental alkaline 
phosphatase (Stratagene; La Jolla, California). In yet another example, useful prokaryotic 
heterologous signal sequences include the phoA secretory signal (Sambrook et al., supra) 

25 and the protein A secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion protein 
in which all or part of a polypeptide of the invention is fused to sequences derived from a 
member of the immunoglobulin protein family. The immunoglobulin fusion proteins of 
the invention can be incorporated into pharmaceutical compositions and administered to a 

30 subject to inhibit an interaction between a ligand (soluble or membrane-bound) and a 
protein on the surface of a cell (receptor), to thereby suppress signal transduction in vivo. 
The immunoglobulin fusion protein can be used to affect the bioavailability of a cognate 
ligand of a polypeptide of the invention. Inhibition of ligand/receptor interaction may be 
useful therapeutically, both for treating proliferative and difTerentiative disorders and for 

35 modulating {e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin 
fusion proteins of the invention can be used as immunogens to produce antibodies directed 
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15 



against a polypeptide of the invention in a subject, to purify ligands and in screening 
assays to identify molecules which inhibit the interaction of receptors with ligands. 

Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be synthesized 

^ by conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers which give rise to 
complementary overhangs between two consecutive gene fragments which can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, e.g., 
Ausubel et al., supra). Moreover, many expression vectors are commercially available 

10 that already encode a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a 
polypeptide of the invention can be cloned into such an expression vector such that the 
fusion moiety is linked in- frame to the polypeptide of the invention. 

A signal sequence of a polypeptide of the invention (SEQ ID NO:5, 12, 19, 25, 30, 
41, 49 or 59) can be used to facilitate secretion and isolation of the secreted protein or 
other proteins of interest. Signal sequences are typically characterized by a core of 
hydrophobic amino acids which are generally cleaved from the mature protein during 
secretion in one or more cleavage events. Such signal peptides contain processing sites 
that allow cleavage of the signal sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to the described polypeptides having a 
signal sequence, as well as to the signal sequence itself and to the polypeptide in the 
absence of the signal sequence (i.e., the cleavage products). In one embodiment, a nucleic 
acid sequence encoding a signal sequence of the invention can be operably linked in an 
expression vector to a protein of interest, such as a protein which is ordinarily not secreted 
or is otherwise difficult to isolate. The signal sequence directs secretion of the protein, 
such as from a eukaryotic host into which the expression vector is transformed, and the 
signal sequence is subsequently or concurrently cleaved. The protein can then be readily 
purified from the extracellular medium by art recognized methods. Alternatively, the 
signal sequence can be linked to the protein of interest using a sequence which facilitates 
purification, such as with a GST domain. 

In another embodiment, the signal sequences of the present invention can be used 
to identify regulatory sequences, e.g., promoters, enhancers, repressors. Since signal 
sequences are the most amino-terminal sequences of a peptide, it is expected that the 
nucleic acids which flank the signal sequence on its amino-terminal side will be regulatory 
35 sequences which affect transcription. Thus, a nucleotide sequence which encodes all or a 
portion of a signal sequence can be used as a probe to identify and isolate signal sequences 
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and their flanking regions, and these flanking regions can be studied to identify regulatory 
elements therein. 

The present invention also pertains to variants of the polypeptides of the invention. 
Such variants have an altered amino acid sequence which can function as either agonists 

^ (mimetics) or as antagonists. Variants can be generated by mutagenesis, e.g., discrete 
point mutation or truncation. An agonist can retain substantially the same, or a subset, of 
the biological activities of the naturally occurring form of the protein. An antagonist of a 
protein can inhibit one or more of the activities of the naturally occurring form of the 
protein by, for example, competitively binding to a downstream or upstream member of a 

1 0 cellular signaling cascade which includes the protein of interest. Thus, specific biological 
effects can be elicited by treatment with a variant of limited function. Treatment of a 
subject with a variant having a subset of the biological activities of the naturally occurring 
form of the protein can have fewer side effects in a subject relative to treatment with the 
naturally occurring form of the protein. 

15 

Variants of a protein of the invention which function as either agonists (mimetics) 
or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
one embodiment/a variegated library of variants is generated by combinatorial 
^ mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
potential protein sequences is expressible as individual polypeptides, or alternatively, as a 
set of larger fusion proteins (e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the polypeptides of the 
invention from a degenerate oligonucleotide sequence. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see t e.g., Narang (1983) Tetrahedron 
39:3; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477). 

3q In addition, libraries of fragments of the coding sequence of a polypeptide of the 

invention can be used to generate a variegated population of polypeptides for screening 
and subsequent selection of variants. For example, a library of coding sequence fragments 
can be generated by treating a double stranded PCR fragment of the coding sequence of 
interest with a nuclease under conditions wherein nicking occurs only about once per 

35 molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with S 1 nuclease, 
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and ligating the resulting fragment library into an expression vector. By this method, an 
expression library can be derived which encodes N-terminal and internal fragments of 
various sizes of the protein of interest. 

Several techniques are known in the art for screening gene products of 
^ combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. The most widely used techniques, 
which are amenable to high through-put analysis, for screening large gene libraries 
typically include cloning the gene library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 
10 genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a technique which enhances the frequency of functional mutants in the libraries, 
can be used in combination with the screening assays to identify variants of a protein of 
the invention (Arkin and Yourvan (1992) Proc. Natl Acad. Sci. USA 59:781 1-7815; 
15 Delgrave et al. (1993) Protein Engineering 6(3):327-331). 

The polypeptides of the invention can exhibit post-translational modifications, 
including, but not limited to glycosylations, (e.g., N-linked or O-linked glycosylations), 
myristylations, palmitylations, acetylations and phosphorylations (e.g., serine/threonine or 
tyrosine). In one embodiment, the TANGO 253, TANGO 257, INTERCEPT 258 or 

20 

TANGO 281 polypeptides of the invention exhibit reduced levels of O-linked 
glycosylation and/or N-linked glycosylation relative to endogenously expressed TANGO 
253, TANGO 257, INTERCEPT 258 or TANGO 281 polypeptides of the invention do not 
exhibit O-linked glycosylation or N-linked glycosylation. The post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 

25 

polypeptides will vary depending upon the host cell in which in TANGO 253, TANGO 
257, INTERCEPT 258 or TANGO 281 is expressed. Further, post-translational 
modifications of TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281 
polypeptides such as glycosylation can be prevented by treating cells, e.g., with 
tunicamycin. 

30 

An isolated polypeptide of the invention, or a fragment thereof, can be used as an 
immunogen to generate antibodies using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length polypeptide or protein can be used or, 
alternatively, the invention provides antigenic peptide fragments for use as immunogens. 
35 In one embodiment, an isolated polypeptide or fragment thereof which lacks N- and/or O- 
linked glycosylation is used as an immunogen to generate antibodies using standard 
techniques known to those of skill in the art. The antigenic peptide of a protein of the 
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invention comprises at least 8 (preferably 10, 15, 20, or 30) amino acid residues of the 
amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164 and encompasses an epitope of the protein 
5 such that an antibody raised against the peptide forms a specific immune complex with the 
protein. 

Preferred epitopes encompassed by the antigenic peptide are regions that are 
located on the surface of the protein, e.g., hydrophilic regions. Figures 2, 4, 10, 12, 19, 21, 
29 and 32, are hydropathy plots of the proteins of the invention. These plots or similar 
10 analyses can be used to identify hydrophilic regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
subject, (e.g., rabbit, goat, mouse or other mammal). An appropriate immunogenic 
preparation can contain, for example, recombinantly expressed or chemically synthesized 
polypeptide. The preparation can further include an adjuvant, such as Freund's complete 
or incomplete adjuvant, or similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 

20 molecules, i.e., molecules that contain an antigen binding site which specifically binds an 
antigen, such as a polypeptide of the invention e.g., an epitope of a polypeptide of the 
invention. A molecule which specifically binds to a given polypeptide of the invention is 
a molecule which binds the polypeptide, but does not substantially bind other molecules in 
a sample, e.g., a biological sample, which naturally contains the polypeptide. Examples of 

25 immunologically active portions of immunoglobulin molecules include F(ab) and F(ab')2 
fragments which can be generated by treating the antibody with an enzyme such as pepsin. 
The invention provides polyclonal and monoclonal antibodies. The term "monoclonal 
antibody" or "monoclonal antibody composition", as used herein, refers to a population of 
antibody molecules that contain only one species of an antigen binding site capable of 

30 immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies directed 
against a polypeptide or polypeptides of the invention. Particularly preferred polyclonal 
35 antibody preparations are ones that contain only antibodies directed against a polypeptide 
or polypeptides of the invention. Particularly preferred immunogen compositions are 
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those that contain no other human proteins such as, for example, immunogen 
compositions made using a non-human host cell for recombinant expression of a 
polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will be 
5 present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in an immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 
immobilized polypeptide. If desired, the antibody molecules can be isolated from the 
mammal (e.g., from the blood) and further purified by well-known techniques, such as 

10 protein A chromatography to obtain the IgG fraction. Alternatively, antibodies specific 
for a protein or polypeptide of the invention can be selected for (e.g., partially purified) or 
purified by, e.g., affinity chromatography. For example, a recombinantly expressed and 
purified (or partially purified) protein of the invention is produced as described herein, and 
covalently or non-covalently coupled to a solid support such as, for example, a 

15 chromatography column. The column can then be used to affinity purify antibodies 
specific for the proteins of the invention from a sample containing antibodies directed 
against a large number of different epitopes, thereby generating a substantially purified 
antibody composition, i.e., one that is substantially free of contaminating antibodies. By a 
substantially purified antibody composition is meant, in this context, that the antibody 

20 sample contains at most only 30% (by dry weight) of contaminating antibodies directed 
against epitopes other than those on the desired protein or polypeptide of the invention, 
and preferably at most 20%, yet more preferably at most 10%, and most preferably at most 
5% (by dry weight) of the sample is contaminating antibodies. A purified antibody 
composition means that at least 99% of the antibodies in the composition are directed 
against the desired protein or polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to prepare 
monoclonal antibodies by standard techniques, such as the hybridoma technique originally 
described by Kohler and Milstein (1975) Nature 256:495-497, the human B cell 
hybridoma technique (Kozbor et al. (1983) Immunol. Today 4:72), the EBV-hybridoma 
technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96) or trioma techniques. The technology for producing hybridomas is well 
known (see generally Current Protocols in Immunology (1994) Coligan et al. (eds.) John 
Wiley & Sons, Inc., New York, NY). Hybridoma cells producing a monoclonal antibody 
of the invention are detected by screening the hybridoma culture supematants for 
antibodies that bind the polypeptide of interest, e.g., using a standard ELISA assay. 
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Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
antibody directed against a polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 
display library) with the polypeptide of interest. Kits for generating and screening phage 

5 display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 

Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage Display 
Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly 
amenable for use in generating and screening antibody display library can be found in, for 
example, U.S. Patent No. 5,223,409; PCT Publication No. WO 92/18619; PCT 

10 Publication No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; 
PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al. 
(1991) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. Antibod. Hybridomas 
3:81-85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBOJ. 

15 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the invention. 
A chimeric antibody is a molecule in which different portions are derived from different 

2 ^ animal species, such as those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. (See, e.g., Cabilly et al., U.S. Patent No. 
4,816,567; and Boss et al., U.S. Patent No. 4,816397, which are incorporated herein by 
reference in their entirety.) Humanized antibodies are antibody molecules from non- 
human species having one or more complementarily determining regions (CDRs) from 

2$ the non-human species and a framework region from a human immunoglobulin molecule. 
(See, e.g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by reference in 
its entirety.) Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example using methods described in 
PCT Publication No. WO 87/02671; European Patent Application 184,187; European 

30 Patent Application 1 71 ,496; European Patent Application 173,494; PCT Publication No. 
WO 86/01533; U.S. Patent No. 4,816,567; European Patent Application 125,023; Better et 
al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. Natl. Acad. ScL USA 
84:3439-3443; Liu et al. (1987) Immunol. 139:3521-3526; Sun et al. (1987) Proc. Natl. 
Acad. Sci. USA 84:214-21 8; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. 

35 (1985) Nature 314:446-449; and Shaw et al. (1988)7. Natl. Cancer Inst. 80:1553-1559); 
Morrison (1985) Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; U.S. 



-90- 



WO 00/78808 



PCT/US00/16883 



Patent 5,225,539; Jones et al. (1986) Nature 321 :552-525; Verhoeyan et al. (1988) Science 
239:1534; and Beidler et al. (1988)7. Immunol 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic treatment 
of human patients. Such antibodies can be produced, for example, using transgenic mice 

^ which are incapable of expressing endogenous immunoglobulin heavy and light chains 
genes, but which can express human heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide of the invention. Monoclonal antibodies directed against the antigen can be 
obtained using conventional hybridoma technology. The human immunoglobulin 

* 0 transgenes harbored by the transgenic mice rearrange during B cell differentiation, and 
subsequently undergo class switching and somatic mutation. Thus, using such a 
technique, it is possible to produce therapeutically useful IgG, IgA and IgE antibodies. 
For an overview of this technology for producing human antibodies, see Lonberg and 
Huszar (1995, Int. Rev. Immunol. 13:65-93). For a detailed discussion of this technology 

1 5 for producing human antibodies and human monoclonal antibodies and protocols for 
producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. Patent 5,633,425; U.S. 
Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In addition, 
companies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described 

20 above. 

Completely human antibodies which recognize a selected epitope can be generated 
using a technique referred to as "guided selection." In this approach a selected non-human 
monoclonal antibody, e.g., a mouse antibody, is used to guide the selection of a 
completely human antibody recognizing the same epitope. (Jespers et al. (1994) 
Bio/technology 12:899-903). 

An antibody directed against a polypeptide of the invention (e.g., monoclonal 
antibody) can be used to isolate the polypeptide by standard techniques, such as affinity 
chromatography or immunoprecipitation. Moreover, such an antibody can be used to 

3Q detect the protein (e.g., in a cellular lysate or cell supernatant) in order to evaluate the 
abundance and pattern of expression of the polypeptide. The antibodies can also be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can 
be facilitated by coupling the antibody to a detectable substance. Examples of detectable 

35 substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, beta-galactosidase, or 
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acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material 
5 includes luminol; examples of bioluminescent materials include luciferase, luciferin, and 
aequorin, and examples of suitable radioactive material include 125j f 131 1, 35s or ^H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to cells. Examples include taxol, 

10 cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites 

15 (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 

decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, melphalan, 
carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 

2® doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti -mitotic agents (e.g., vincristine and 
vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
^ response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor 
necrosis factor, d-interferon, P-interferon, nerve growth factor, platelet derived growth 
^ factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 
("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), granulocyte 
colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating a therapeutic moiety to antibodies are well known, see, 
35 e.g., Arnon et al, "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 
243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in 
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Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, 
Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A 
Review", in Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et 
al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future Prospective Of The 
5 Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in Monoclonal 
Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 
(Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic Properties Of 
Antibody-Toxin Conjugates", Immunol. Rev., 62:119-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
10 antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

Accordingly, in one aspect, the invention provides substantially purified antibodies 
or fragment thereof, including human, non-human, chimeric, and humanized antibodies, 
which antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 1 10, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino acid sequence encoded by the 
cDNA insert of a clone deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 20721 7, Accession Number 207221, or 
patent deposit Number PTA-224, or a complement thereof. In another aspect, the 
invention provides substantially purified antibodies or fragments thereof, including 
human, non-human, chimeric and humanized antibodies, which antibodies or fragments 
thereof specifically bind to a polypeptide comprising a fragment of at least 8 contiguous 
amino acid residues, preferably at least 15 contiguous amino acid residues, of the amino 
acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 
110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 
146, 148, 150, 152, 154, 156, 158, 160, 162, or 164. 

In another aspect, the invention provides substantially purified antibodies or 
fragments thereof, including human, non-human, chimeric and humanized antibodies, 

30 which antibodies or fragments thereof, which antibodies or fragments thereof specifically 
bind to a polypeptide comprising an amino acid sequence which is at least 95% identical 
to the amino acid sequence of any one of SEQ ED NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, wherein the percent 

35 identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4. In 
another aspect, the invention provides substantially purified antibodies or fragments 
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thereof, including human, non-human, chimeric and humanized antibodies, which 
antibodies or fragments thereof specifically bind to a polypeptide comprising and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
nucleic acid molecule consisting of any one of SEQ ID Nos:l, 2, 8, 9, 15, 16, 21, 22, 26, 

5 27,37,38,46,47,56,57,77,80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191or 192, orthecDNA 
insert of a clone deposited as ATCC® as Accession Number 207222, Accession Number 

10 20721 5, Accession Number 207217, Accession number 207221 or patent deposit Number 
PTA-224, or a complement thereof, under conditions of hybridization of 6X SSC at 45 °C 
and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55°C, 60°C or 65°C. 

In various embodiments, the substantially purified antibodies or fragments thereof 
of the invention are polyclonal, monoclonal, Fab fragments, single chain antibodies, or 
* 5 F(ab') 2 fragments. The non-human antibodies or fragments thereof of the invention can be 
goat, mouse, sheep, horse, chicken, rabbit or rat antibodies or antibodies fragments. In a 
preferred embodiment, the antibodies of the invention are monoclonal antibodies that 
specifically bind to a polypeptide of the invention. 

The substantially purified antibodies or fragments thereof specifically bind to a 
signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof of the invention specifically bind to a secreted sequence or an extracellular domain 
of the amino acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 
108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 
144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or the amino acid sequence 
encoded by the EpT253, EpTm253, EpT257, EpTm257, EpT258, EpTm258, EpT281 or 
EpTm281 cDNA insert of ATCC® Accession Number 207222, Accession Number 
207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224, or a complement thereof. In one embodiment, the extracellular domain 
to which the antibody or antibody fragment binds comprises at least 8 contiguous amino 
acid residues, preferably at least 10 or at least 15 contiguous amino acid residues, of 
amino acid residues 30 to 206 of SEQ ID NO:28 (SEQ ID NO:76), amino acid residues 
272 to 370 of SEQ ID NO:28 (SEQ ID NO:34); amino acid residues 30 to 249 of SEQ ID 
NO:39 (SEQ ID NO: 83), amino acid residues 39 to 123 of SEQ ID NO:48 (SEQ ID 
NO:50), or amino acid residues 27 to 1 12 of SEQ ID NO:58 (SEQ ID NO:61). 
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Any of the antibodies of the invention can be conjugated to a therapeutic moiety or 
to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a radioactive 
5 material. 

The invention also provides a kit containing an antibody of the invention 
conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and a 
pharmaceutical^ acceptable carrier. In preferred embodiments, the pharmaceutical 
10 composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258, and TANGO 281, 
the method comprising immunizing a mammal with a polypeptide. In one embodiment, 
the polypeptide used as an immunogens comprises an amino acid sequence of any one of 
SEQIDNOs:3, 10, 17, 23, 28, 39, 48,58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 
120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 
156, 158, 160, 162 or 164, or an amino acid sequence encoded by the cDNA insert of a 
clone deposited with the ATCC® as Accession Number 207222, Accession Number 
207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224. In another embodiment, the polypeptide used as an immunogen 
comprises a fragment of at least 15 amino acid residues, preferably at least 25 amino acid 
residues, of the amino acid sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 
58, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 
136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162 or 164, or an amino 
acid sequence which is at least 85%, preferably at least 95% identical to the amino acid 
sequence of any one of SEQ ID NOs:3, 10, 17, 23, 28, 39, 48, 58, 102, 104, 106, 108, 110, 
112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 
148, 150, 152, 154, 156, 158, 160, 162 or 164, wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4. 
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In another embodiment, the polypeptide used as an immunogen comprises an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to the 
35 nucleic acid molecule consisting of any one of SEQ ID NOs:l, 2, 8, 9, 15, 16, 21, 22, 26, 
27,37, 38,46, 47, 56, 57, 77, 80,91, 100, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 
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155, 157, 159, 161, 163, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 
178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191 or 192, orthecDNA 
insert of a clone deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 

5 deposit Number PTA-224, or a complement thereof, under conditions of hybridization of 
6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 50°C, 55°C, 60°C or 65°C. 
After immunization, a sample is collected from the mammal that contains an antibody that 
specifically recognizes TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 281, a 
fragment thereof, or allellic variant thereof. Preferably, the polypeptide is recombinantly 

1 0 produced using a non-human host cell. Optionally, the antibodies can be further purified 
from the sample using techniques well known to those of skill in the art. The method can 
further comprise producing a monoclonal antibody- producing cell from the cells of the 
mammal. Optionally, antibodies are collected from the antibody-producing cell. 



III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a polypeptide of the invention (or a portion thereof). 
As used herein, the term "vector 0 refers to a nucleic acid molecule capable of transporting 

20 another nucleic acid to which it has been linked. One type of vector is a "plasmid", which 
refers to a circular double stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a 
host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of 

25 replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 

mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the expression of genes to which they 
are operably linked. In general, expression vectors of utility in recombinant DNA 

30 techniques are often in the form of plasmids (vectors). However, the invention is intended 
to include such other forms of expression vectors, such as viral vectors (e.g., replication 
defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
35 invention in a form suitable for expression of the nucleic acid in a host cell. This means 
that the recombinant expression vectors include one or more regulatory sequences, 
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selected on the basis of the host cells to be used for expression, which is operably linked to 
the nucleic acid sequence to be expressed. Within a recombinant expression vector, 
"operably linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner which allows for expression of the nucleotide 

5 sequence (e.g., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). The term "regulatory sequence" is intended to 
include promoters, enhancers and other expression control elements (e.g., polyadenylation 
signals). Such regulatory sequences are described, for example, in Goeddel, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 

10 (1 990). Regulatory sequences include those which direct constitutive expression of a 
nucleotide sequence in many types of host cell and those which direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). 
It will be appreciated by those skilled in the art that the design of the expression vector can 
depend on such factors as the choice of the host cell to be transformed, the level of 

15 expression of protein desired, etc. The expression vectors of the invention can be 
introduced into host cells to thereby produce proteins or peptides, including fusion 
proteins or peptides, encoded by nucleic acids as described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic {e.g., E. coli ) or eukaryotic 
20 cells (e.g. y insect cells (using baculovirus expression vectors), yeast cells or mammalian 
cells). Suitable host cells are discussed further in Goeddel, supra. Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

^ Expression of proteins in prokaryotes is most often carried out in E. coli with 

vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 2) 
to increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 

^ their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
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(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studier et aL, Gene Expression 

^ Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 
60-89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the pET 
lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host 

1 0 strains BL2 1 (DE3) or HMS 1 74(DE3) from a resident X prophage harboring a T7 gn 1 gene 
under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, California (1990) 1 19-128). Another strategy is to alter 
the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so 
that the individual codons for each amino acid are those preferentially utilized in E. coli 
(Wada et al. (1992) Nucleic Acids Res. 20:21 11-2118). Such alteration of nucleic acid 
^ sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et al. 
(1987) EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), 
pJRY88 (Schultz et al. (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San 
25 Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 9 
cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329:840) and pMT2PC 
(Kaufrnan et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 

35 

example, commonly used promoters are derived from polyoma, Adenovirus 2, 
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cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et al., supra. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type {e.g., 

^ tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 

10 Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 

15 4,873,316 and European Application Publication No. 264,166). 

Developmentally-regulated promoters are also encompassed, for example the mouse hox 
promoters (Kessel and Gruss (1990) Science 249:374-379) and the beta-fetoprotein 
promoter (Campes and Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a regulatory sequence in a manner which 
allows for expression (by transcription of the DNA molecule) of an RNA molecule which 
is antisense to the mRNA encoding a polypeptide of the invention. Regulatory 
sequences operably linked to a nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the antisense RNA molecule in a variety 
of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be 
chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under 
the control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation of 
gene expression using antisense genes see Weintraub et al. (Reviews - Trends in Genetics, 
Vol. 1(1) 1986). 

35 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
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refer not only to the particular subject cell but to the progeny or potential progeny of such 
a cell. Because certain modifications may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may not, in fact, be identical to the 
parent cell, but are still included within the scope of the term as used herein. 

5 A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 

cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. (supra), and other laboratory manuals. 

1 5 For stable transfection of mammalian cells, it is known that, depending upon the 

expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 

20 selectable markers include those which confer resistance to drugs, such as G41 8, 

hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

In another embodiment, the expression characteristics of an endogenous (e.g., 
25 TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes) within a cell, cell 
line or microorganism may be modified by inserting a DNA regulatory element 
heterologous to the endogenous gene of interest into the genome of a cell, stable cell line 
or cloned microorganism such that the inserted regulatory element is operatively linked 
with the endogenous gene (e.g., TANGO 253, TANGO 257, INTERCEPT 258 and 
30 TANGO 281 genes) and controls, modulates or activates. For example, endogenous 

TANGO 253, TANGO 257, INTERCEPT 258 and TANGO 281 genes which are normally 
"transcriptionally silent", i.e., a TANGO 253, TANGO 257, INTERCEPT 258 and 
TANGO 281 genes which is normally not expressed, or are expressed only at very low 
levels in a cell line or microorganism, may be activated by inserting a regulatory element 
35 which is capable of promoting the expression of a normally expressed gene product in that 
cell line or microorganism. Alternatively, transcriptionally silent, endogenous TANGO 
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253, TANGO 257, INTERCEPT 258 and TANGO 281 genes may be activated by 
insertion of a promiscuous regulatory element that works across cell types. 

A heterologous regulatory element may be inserted into a stable cell line or cloned 
microorganism, such that it is operatively linked with endogenous TANGO 253, TANGO 
5 257, INTERCEPT 258 and TANGO 28 1 genes, using techniques, such as targeted 
homologous recombination, which are well known to those of skill in the art, and 
described e.g., in Chappel, U.S. Patent No. 5,272,071; PCT publication No. WO 
91/06667, published May 16, 1991. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
can be used to produce a polypeptide of the invention. Accordingly, the invention further 
provides methods for producing a polypeptide of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of invention 
(into which a recombinant expression vector encoding a polypeptide of the invention has 
been introduced) in a suitable medium such that the polypeptide is produced. In another 
embodiment, the method further comprises isolating the polypeptide from the medium or 
the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 

20 oocyte or an embryonic stem cell into which a sequence encoding a polypeptide of the 
invention has been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous sequences encoding a polypeptide of the invention 
have been introduced into their genome or homologous recombinant animals in which 
endogenous encoding a polypeptide of the invention sequences have been altered. Such 

25 animals are useful for studying the function and/or activity of the polypeptide and for 
identifying and/or evaluating modulators of polypeptide activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include non-human primates, sheep, 

30 dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, an "homologous recombinant animal" is a non-human animal, preferably a 

35 mammal, more preferably a mouse, in which an endogenous gene has been altered by 
homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing nucleic acid 
encoding a polypeptide of the invention (or a homologue thereof) into the male pronuclei 

^ of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the oocyte 
to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency of 
expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to particular 

*0 cells. Methods for generating transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. Patent No. 
4,873,191 and in Hogan, Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986) and Wakayama et aL, (1999), Proc. 

15 Natl. Acad. Sci. USA, 96:14984-14989. Similar methods are used for production of other 
transgenic animals. A transgenic founder animal can be identified based upon the 
presence of the transgene in its genome and/or expression of mRNA encoding the 
transgene in tissues or cells of the animals. A transgenic founder animal can then be used 
to breed additional animals carrying the transgene. Moreover, transgenic animals carrying 

20 the transgene can further be bred to other transgenic animals carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which contains 
at least a portion of a gene encoding a polypeptide of the invention into which a deletion, 
addition or substitution has been introduced to thereby alter, e.g., functionally disrupt, the 
gene. In a preferred embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous gene is functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" vector). Alternatively, the vector can 
be designed such that, upon homologous recombination, the endogenous gene is mutated 
or otherwise altered but still encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of the endogenous protein). In the 
homologous recombination vector, the altered portion of the gene is flanked at its 5' and 3* 
ends by additional nucleic acid of the gene to allow for homologous recombination to 
occur between the exogenous gene carried by the vector and an endogenous gene in an 
embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3* ends) are included in the vector 
(see, e.g., Thomas and Capecchi (1987) Cell 51:503 for a description of homologous 



25 



30 



35 



-102- 



WO 00/78808 



PCT/US00/16883 



recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., by 
electroporation) and cells in which the introduced gene has homologously recombined 
with the endogenous gene are selected (see, e.g., Li et al. (1992) Cell 69:915). The 
selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 

5 aggregation chimeras (see, e.g., Bradley in Teratocarcinomas and Embryonic Stem Cells: 
A Practical Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric 
embryo can then be implanted into a suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the 

10 homologously recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
1 5 contain selected systems which allow for regulated expression of the transgene. One 

example of such a system is the cre/loxP recombinase system of bacteriophage P 1 . For a 
description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. Natl. 
Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 
20 251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
25 recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut et al. (1997) Nature 385:810-813 and PCT 
Publication NOS. WO 97/07668 and WO 97/07669. 

30 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 

35 

nucleic acid molecule, protein, or antibody and a pharmaceutical^ acceptable carrier. As 
used herein the language "pharmaceutically acceptable carrier" is intended to include any 
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and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic 
and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for pharmaceutically active substances 
is well known in the art. Except insofar as any conventional media or agent is 
5 incompatible with the active compound, use thereof in the compositions is contemplated. 
Supplementary active compounds can also be incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid of the invention. 
Such methods comprise formulating a pharmaceutically acceptable carrier with an agent 

10 which modulates expression or activity of a polypeptide or nucleic acid of the invention. 
Such compositions can further include additional active agents. Thus, the invention 
further includes methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or activity 
of a polypeptide or nucleic acid of the invention and one or more additional active 

15 compounds. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium 
chloride or dextrose. pH can be adjusted with acids or bases, such as hydrochloric acid or 
sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable 
syringes or multiple dose vials made of glass or plastic. 

30 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ 
(BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition 

3 5 must be sterile and should be fluid to the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
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solvent or dispersion medium containing, for example, water, ethanol, polyol (for 
example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and 
suitable mixtures thereof. The proper fluidity can be maintained, for example, by the use 
of a coating such as lecithin, by the maintenance of the required particle size in the case of 

5 dispersion and by the use of surfactants. Prevention of the action of microorganisms can 
be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, 
sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 

10 compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., a polypeptide or antibody) in the required amount in an appropriate solvent with one 
or a combination of ingredients enumerated above, as required, followed by filtered 

^ sterilization. Generally, dispersions are prepared by incorporating the active compound 
into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and freeze-drying which yields a powder of the active ingredient plus any 

20 additional desired ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
^ used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches and the like can 

30 contain any of the following ingredients, or compounds of a similar nature: a binder such 
as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 

35 methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
^ transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
*0 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

15 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 

20 polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 
of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 

25 prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject to 

30 be treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding 

35 such an active compound for the treatment of individuals. 
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For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
5 antibodies. Accordingly, lower dosages and less frequent administration is often possible. 
Modifications such as lipidation can be used to stabilize antibodies and to enhance uptake 
and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et al. ((1997) 7. Acquired Immune Deficiency Syndromes and 
Human Retrovirology 14:193). 

'0 As defined herein, a therapeutically effective amount of protein or polypeptide 

(i.e., an effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably 
about 0.01 to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weight, 
and even more preferably about 1 to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, 
or 5 to 6 mg/kg body weight. 

15 

The skilled artisan will appreciate that certain factors may influence the dosage 
required to effectively treat a subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health and/or age of the subject, and 
other diseases present. Moreover, treatment of a subject with a therapeutically effective 

2^ amount of a protein, polypeptide, or antibody can include a single treatment or, preferably, 
can include a series of treatments. In a preferred example, a subject is treated with 
antibody, protein, or polypeptide in the range of between about 0.1 to 20 mg/kg body 
weight, one time per week for between about 1 to 10 weeks, preferably between 2 to 8 
weeks, more preferably between about 3 to 7 weeks, and even more preferably for about 4, 

^ 5, or 6 weeks. It will also be appreciated that the effective dosage of antibody, protein, or 
polypeptide used for treatment may increase or decrease over the course of a particular 
treatment. Changes in dosage may result and become apparent from the results of 
diagnostic assays as described herein. 

The present invention encompasses agents which modulate expression or activity. 

30 An agent may, for example, be a small molecule. For example, such small molecules 
include, but are not limited to, peptides, peptidomimetics, amino acids, amino acid 
analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic 
or inorganic compounds (i.e f . including heteroorganic and organometallic compounds) 
having a molecular weight less than about 10,000 grams per mole, organic or inorganic 

35 compounds having a molecular weight less than about 5,000 grams per mole, organic or 
inorganic compounds having a molecular weight less than about 1,000 grams per mole, 
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organic or inorganic compounds having a molecular weight less than about 500 grams per 
mole, and salts, esters, and other pharmaceutically acceptable forms of such compounds. 

It is understood that appropriate doses of small molecule agents depends upon a 
number of factors within the ken of the ordinarily skilled physician, veterinarian, or 
researcher. The dose(s) of the small molecule will vary, for example, depending upon the 
identity, size, and condition of the subject or sample being treated, further depending upon 
the route by which the composition is to be administered, if applicable, and the effect 
which the practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. Exemplary doses include milligram or microgram amounts 
of the small molecule per kilogram of subject or sample weight (e.g., about 1 microgram 
per kilogram to about 500 milligrams per kilogram, about 100 micrograms per kilogram 
to about 5 milligrams per kilogram, or about 1 microgram per kilogram to about 50 
micrograms per kilogram. It is furthermore understood that appropriate doses of a small 
molecule depend upon the potency of the small molecule with respect to the expression or 
15 activity to be modulated. Such appropriate doses may be determined using the assays 
described herein. When one or more of these small molecules is to be administered to an 
animal (e.g., a human) in order to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or researcher may, for example, 
prescribe a relatively low dose at first, subsequently increasing the dose until an 
20 appropriate response is obtained. In addition, it is understood that the specific dose level 
for any particular animal subject will depend upon a variety of factors including the 
activity of the specific compound employed, the age, body weight, general health, gender, 
and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (U.S. Patent 5,328,470) or by 
stereotactic injection (see, e.g., Chen et al. (1994) Proc. Natl Acad. ScL USA 
91 :3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 
pharmaceutical preparation can include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 
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V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, and antibodies 
described herein can be used in one or more of the following methods: a) screening assays; 
b) detection assays (e.g., chromosomal mapping, tissue typing, forensic biology); c) 

5 predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring clinical trials, 
and pharmacogenomics); and d) methods of treatment (e.g., therapeutic and prophylactic). 
The isolated nucleic acid molecules of the invention can be used to express proteins (e.g., 
via a recombinant expression vector in a host cell in gene therapy applications), to detect 
mRNA (e.g., in a biological sample) or a genetic lesion, and to modulate activity of a 

10 polypeptide of the invention. In addition, the polypeptides of the invention can be used to 
screen drugs or compounds which modulate activity or expression of a polypeptide of the 
invention as well as to treat disorders characterized by insufficient or excessive production 
of a protein of the invention or production of a form of a protein of the invention which 
has decreased or aberrant activity compared to the wild type protein. In addition, the 

* 5 antibodies of the invention can be used to detect and isolate a protein of the and modulate 
activity of a protein of the invention. 

This invention further pertains to novel agents identified by the 
above-described screening assays and uses thereof for treatments as described herein. 

20 

A. Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) which bind to polypeptide of the 

25 

invention or have a stimulatory or inhibitory effect on, for example, expression or activity 
of a polypeptide of the invention. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of a 
3 0 polypeptide of the invention or biologically active portion thereof. The test compounds of 
. the present invention can be obtained using any of the numerous approaches in 

combinatorial library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and synthetic 
35 library methods using affinity chromatography selection. The biological library approach 
is limited to peptide libraries, while the other four approaches are applicable to peptide, 
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non-peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer 
DrugDes. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al. (1993) Proc. Natl Acad. ScL USA 90:6909; Erb et al. 
5 (1994) Proc. Natl Acad. ScL USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chem. 
37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. 
Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al. 
(1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Bio/Techniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent NOS. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al. (1992) Proc. Natl. Acad Sci. 
USA 89:1865-1869) or phage (Scott and Smith (1990) Science 249:386-390; Devlin 
(1990) Science 249:404-406; Cwirla et al. (1990) Proa Natl. Acad. Sci. USA 
87:6378-6382; and Felici (1991)7. Mol. Biol 222:301-310). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of a polypeptide of the invention, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

20 compound to bind to the polypeptide determined. The cell, for example, can be a yeast 
cell or a cell of mammalian origin. Determining the ability of the test compound to bind 
to the polypeptide can be accomplished, for example, by coupling the test compound with 
a radioisotope or enzymatic label such that binding of the test compound to the 
polypeptide or biologically active portion thereof can be determined by detecting the 

25 labeled compound in a complex. For example, test compounds can be labeled with 125^ 
3^S, 14c, or ^H, either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, test compounds 
can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 

30 conversion of an appropriate substrate to product. In a preferred embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of a polypeptide of 
the invention, or a biologically active portion thereof, on the cell surface with a known 
compound which binds the polypeptide to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to interact 

35 with the polypeptide, wherein determining the ability of the test compound to interact with 
the polypeptide comprises determining the ability of the test compound to preferentially 
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bind to the polypeptide or a biologically active portion thereof as compared to the known 
compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of a polypeptide of the invention, or a biologically 

^ active portion thereof, on the cell surface with a test compound and determining the ability 
of the test compound to modulate (e.g., stimulate or inhibit) the activity of the polypeptide 
or biologically active portion thereof Determining the ability of the test compound to 
modulate the activity of the polypeptide or a biologically active portion thereof can be 
accomplished, for example, by determining the ability of the polypeptide protein to bind to 

1 0 or interact with a target molecule. 

Determining the ability of a polypeptide of the invention to bind to or interact with 
a target molecule can be accomplished by one of the methods described above for 
determining direct binding. As used herein, a "target molecule" is a molecule with which 
a selected polypeptide (e.g., a polypeptide of the invention) binds or interacts with in 
nature, for example, a molecule on the surface of a cell which expresses the selected 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic 
molecule. A target molecule can be a polypeptide of the invention or some other 
polypeptide or protein. For example, a target molecule can be a component of a signal 
transduction pathway which facilitates transduction of an extracellular signal (e.g., a signal 
generated by binding of a compound to a polypeptide of the invention) through the cell 
membrane and into the cell or a second intercellular protein which has catalytic activity or 
a protein which facilitates the association of downstream signaling molecules with a 
polypeptide of the invention. Determining the ability of a polypeptide of the invention to 
bind to or interact with a target molecule can be accomplished by determining the activity 
of the target molecule. For example, the activity of the target molecule can be determined 
by detecting induction of a cellular second messenger of the target (e.g., intracellular 
Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target on an 
appropriate substrate, detecting the induction of a reporter gene (e.g., a regulatory element 
that is responsive to a polypeptide of the invention operably linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
35 comprising contacting a polypeptide of the invention or biologically active portion thereof 
with a test compound and determining the ability of the test compound to bind to the 
polypeptide or biologically active portion thereof. Binding of the test compound to the 
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polypeptide can be determined either directly or indirectly as described above. In a 
preferred embodiment, the assay includes contacting the polypeptide of the invention or 
biologically active portion thereof with a known compound which binds the polypeptide 
to form an assay mixture, contacting the assay mixture with a test compound, and 
5 determining the ability of the test compound to interact with the polypeptide, wherein 
determining the ability of the test compound to interact with the polypeptide comprises 
determining the ability of the test compound to preferentially bind to the polypeptide or 
biologically active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-free assay comprising contacting a 
1 0 polypeptide of the invention or biologically active portion thereof with a test compound 
and determining the ability of the test compound to modulate {e.g., stimulate or inhibit) 
the activity of the polypeptide or biologically active portion thereof. Determining the 
ability of the test compound to modulate the activity of the polypeptide can be 
accomplished, for example, by determining the ability of the polypeptide to bind to a 
15 target molecule by one of the methods described above for determining direct binding. In 
an alternative embodiment, determining the ability of the test compound to modulate the 
activity of the polypeptide can be accomplished by determining the ability of the 
polypeptide of the invention to further modulate the target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be 
20 determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting a polypeptide 
of the invention or biologically active portion thereof with a known compound which 
binds the polypeptide to form an assay mixture, contacting the assay mixture with a test 
^ compound, and determining the ability of the test compound to interact with the 

polypeptide, wherein determining the ability of the test compound to interact with the 
polypeptide comprises determining the ability of the polypeptide to preferentially bind to 
or modulate the activity of a target molecule. 

The cell-free assays of the present invention are amenable to use of both a soluble 
30 form or the membrane-bound form of a polypeptide of the invention. In the case of 
cell-free assays comprising the membrane-bound form of the polypeptide, it may be 
desirable to utilize a solubilizing agent such that the membrane-bound form of the 
polypeptide is maintained in solution. Examples of such solubilizing agents include 
non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n-octylmaltoside, 
35 octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton X-100, Triton X-l 14, 
Thesit, Isotridecypoly(ethyIene glycol ether)n, 

3-[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 
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3-[(3^:holamidopropyl)dimethylamminio]-2-hydroxy-l-propane sulfonate (CHAPSO), or 
N-dodecyl=N,N-dimethyl-3-ammonio-l -propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, 
it may be desirable to immobilize either the polypeptide of the invention or its target 

^ molecule to facilitate separation of complexed from uncomplexed forms of one or both of 
the proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to the polypeptide, or interaction of the polypeptide with a target molecule in 
the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants. Examples of such vessels include microtitre plates, 

10 test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transferase fusion proteins or 
glutathione-S-transferase fusion proteins can be adsorbed onto glutathione sepharose 
beads (Sigma Chemical; St. Louis, MO) or glutathione derivatized microtitre plates, which 

^ are then combined with the test compound or the test compound and either the 

non-adsorbed target protein or A polypeptide of the invention, and the mixture incubated 
under conditions conducive to complex formation (e.g., at physiological conditions for salt 
and pH). Following incubation, the beads or microtitre plate wells are washed to remove 
any unbound components and complex formation is measured either directly or indirectly, 

20 for example, as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the polypeptide of the invention can be 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the polypeptide of the invention or 
its target molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated polypeptide of the invention or target molecules can be prepared from 
biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., 
biotinylation kit, Pierce Chemicals; Rockford, IL), and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with the polypeptide of the invention or target molecules but which do not interfere with 
binding of the polypeptide of the invention to its target molecule can be derivatized to the 
wells of the plate, and unbound target or polypeptide of the invention trapped in the wells 
by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the polypeptide of the invention or target 
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molecule, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the polypeptide of the invention or target molecule. 

In another embodiment, modulators of expression of a polypeptide of the invention 
are identified in a method in which a cell is contacted with a candidate compound and the 

5 expression of the selected mRNA or protein ( i.e.. the mRNA or protein corresponding to a 
polypeptide or nucleic acid of the invention) in the cell is determined. The level of 
expression of the selected mRNA or protein in the presence of the candidate compound is 
compared to the level of expression of the selected mRNA or protein in the absence of the 
candidate compound. The candidate compound can then be identified as a modulator of 

10 expression of the polypeptide of the invention based on this comparison. For example, 
when expression of the selected mRNA or protein is greater (statistically significantly 
greater) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the selected mRNA or protein expression. 
Alternatively, when expression of the selected mRNA or protein is less (statistically 

* 5 significantly less) in the presence of the candidate compound than in its absence, the 
candidate compound is identified as an inhibitor of the selected mRNA or protein 
expression. The level of the selected mRNA or protein expression in the cells can be 
determined by methods described herein. 

In yet another aspect of the invention, a polypeptide of the inventions can be used 
as "bait proteins" in a two-hybrid assay or three hybrid assay {see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol Chem. 
268:12046-12054; Bartel et al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) 
Oncogene 8:1693-1696; and PCT Publication No. WO 94/10300), to identify other 
^ proteins, which bind to or interact with the polypeptide of the invention and modulate 
activity of the polypeptide of the invention. Such binding proteins are also likely to be 
involved in the propagation of signals by the polypeptide of the inventions as, for 
example, upstream or downstream elements of a signaling pathway involving the 
polypeptide of the invention. 

30 This invention further pertains to novel agents identified by the above-described 

screening assays and uses thereof for treatments as described herein. 



B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: (i) map their respective genes on a 
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chromosome and, thus, locate gene regions associated with genetic disease; (ii) identify an 
individual from a minute biological sample (tissue typing); and (iii) aid in forensic 
identification of a biological sample. These applications are described in the subsections 
below. 

5 

1 . Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. Accordingly, 
10 nucleic acid molecules described herein or fragments thereof, can be used to map the 
location of the corresponding genes on a chromosome. The mapping of the sequences to 
chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, genes can be mapped to chromosomes by preparing PCR primers 
15 (preferably 15-25 bp in length) from the sequence of a gene of the invention. Computer 
analysis of the sequence of a gene of the invention can be used to rapidly select primers 
that do not span more than one exon in the genomic DNA, thus complicating the 
amplification process. These primers can then be used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only those hybrids containing the 
20 human gene corresponding to the gene sequences will yield an amplified fragment. For a 
review of this technique, see D'Eustachio et al. ((1983) Science 220:919-924). 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular sequence to a particular chromosome. Three or more sequences can be assigned 
per day using a single thermal cycler. Using the nucleic acid sequences of the invention to 

25 

design oligonucleotide primers, sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies which can similarly be used to map 
a gene to its chromosome include in situ hybridization (described in Fan et al. (1990) 
Proc. Natl Acad. Sci. USA 87:6223-27), pre-screening with labeled flow-sorted 
chromosomes (CITE), and pre-selection by hybridization to chromosome specific cDNA 
^ libraries. Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. For a review of this technique, see Verma et al., (Human Chromosomes: A Manual 
of Basic Techniques (Pergamon Press, New York, 1988)). 

35 Reagents for chromosome mapping can be used individually to mark a single 

chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 



- 115- 



WO 00/78808 



PCT/US00/16883 



15 



noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 
^ position of the sequence on the chromosome can be correlated with genetic map data. 
(Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man, 
available on-line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
10 described in, e.g., Egeland et al. (1987) Nature 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with a gene of the invention can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

Furthermore, the nucleic acid sequences disclosed herein can be used to perform 
searches against "mapping databases", e.g., BLAST-type search, such that the 
chromosome position of the gene is identified by sequence homology or identity with 
known sequence fragments which have been mapped to chromosomes. 

25 In the instant case, the human gene for INTERCEPT 258 has been mapped to the 

long arm of chromosome 1 1, in the region q23. Flanking markers for this region are 
Dl 1S936 and Dl 1S933. The CMT4B (Charcot Marie Tooth neuropathy), ED4 
(ecotodermal dysplasia), JBS (Jacobsen Syndrome), TCPT (thrombocytopenia) loci also 
map to this region of the human chromosome. The APOLP1 (apoplipoprotein cluster), 

30 DRD2 (dopamine receptor), and RDX (radixin) genes also map to this region of the 
human chromosome. This region is syntenic to mouse chromosome 9. The atm (ataxia 
telangiectasia), ruf (rough fur), and vs (variable spotting) loci map to this region of the 
mouse chromosome. The lu (luxoid), vs (variable spotting), atm (ataxia telangiectasia), 
rug (rough fur), and lapl (leucine arylaminopeptidase) genes also map to this region of the 

35 mouse chromosome. 
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A polypeptide and fragments and sequences thereof and antibodies specific thereto 
can be used to map the location of the gene encoding the polypeptide on a chromosome. 
This mapping can be carried out by specifically detecting the presence of the polypeptide 
in members of a panel of somatic cell hybrids between cells of a first species of animal 

5 from which the protein originates and cells from a second species of animal and then 
determining which somatic cell hybrid(s) expresses the polypeptide and noting the 
chromosome(s) from the first species of animal that it contains. For examples of this 
technique, see Pajunen et al. (1988/ Cytogenet. Cell Genet. 47:37-41 and Van Keuren et 
al (1986/ Hum. Genet. 74:34-40. Alternatively, the presence of the polypeptide in the 

10 somatic cell hybrids can be determined by assaying an activity or property of the 

polypeptide, for example, enzymatic activity, as described in Bordelon-Riser et al. (1979^ 
Somatic Cell Genetics 5:597-613 and Owerbach et al. (1978; Proa Natl. Acad. ScL USA 
75:5640-5644. 



2. Tissue Typing 

The nucleic acid sequences of the present invention can also be used to identify 
individuals from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for identification 

20 of its personnel. In this technique, an individual's genomic DNA is digested with one or 
more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
sequences of the present invention are useful as additional DNA markers for RFLP 

25 (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 
alternative technique which determines the actual base-by-base DNA sequence of selected 
portions of an individual's genome. Thus, the nucleic acid sequences described herein can 
be used to prepare two PCR primers from the 5 ! and 3 % ends of the sequences. These 
30 primers can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the present 
invention can be used to obtain such identification sequences from individuals and from 
^ tissue. The nucleic acid sequences of the invention uniquely represent portions of the 
human genome. Allelic variation occurs to some degree in the coding regions of these 
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sequences, and to a greater degree in the noncoding regions. It is estimated that allelic 
variation between individual humans occurs with a frequency at about once per each 500 
bases. Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 

5 Because greater numbers of polymorphisms occur in the noncoding regions, fewer 

sequences are necessary to differentiate individuals. The noncoding sequences of SEQ ID 
NO:l, 8, 15, 21, 26, 37, 46 or 56, can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID 

10 NO:2, 9, 16, 22, 27, 38, 47 or 57 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

If a panel of reagents from the nucleic acid sequences described herein is used to 
generate a unique identification database for an individual, those same reagents can later 
be used to identify tissue from that individual. Using the unique identification database, 
^ positive identification of the individual, living or dead, can be made from extremely small 
tissue samples. 



3. Use of Partial Gene Sequences in Forensic Bioloev 

20 DNA-based identification techniques can also be used in forensic biology. 

Forensic biology is a scientific field employing genetic typing of biological evidence 
found at a crime scene as a means for positively identifying, for example, a perpetrator of 
a crime. To make such an identification, PCR technology can be used to amplify DNA 
sequences taken from very small biological samples such as tissues, e.g., hair or skin, or 

25 body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified sequence 
can then be compared to a standard, thereby allowing identification of the origin of the 
biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 

30 

enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme generated 

fragments. Sequences targeted to noncoding regions are particularly appropriate for this 

35 • 

use as greater numbers of polymorphisms occur in the noncoding regions, making it easier 

to differentiate individuals using this technique. Examples of polynucleotide reagents 
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include the nucleic acid sequences of the invention or portions thereof, e.g., fragments 
derived from noncoding regions having a length of at least 20 or 30 bases. 

The nucleic acid sequences described herein can further be used to provide 
polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
3 example, an in situ hybridization technique, to identify a specific tissue, e.g., brain tissue. 
This can be very useful in cases where a forensic pathologist is presented with a tissue of 
unknown origin. Panels of such probes can be used to identify tissue by species and/or by 
organ type. 

10 

C. Predictive Medicine 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 

15 aspect of the present invention relates to diagnostic assays for determining TANGO 253, 
TANGO 257, INTERCEPT 258, or TANGO 281 protein and/or nucleic acid expression as 
well as TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 activity, in the 
context of a biological sample {e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or disorder, or is at risk of developing a 

20 disorder, associated with aberrant or unwanted TANGO 253, TANGO 257, INTERCEPT 
258, or TANGO 281 expression or activity. The invention also provides for prognostic (or 
predictive) assays for determining whether an individual is at risk of developing a disorder 
associated with TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, 
nucleic acid expression or activity. For example, mutations in a TANGO 253, TANGO 

25 257, INTERCEPT 258, or TANGO 281 gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of a disorder characterized by or associated with TANGO 
253, TANGO 257, INTERCEPT 258, or TANGO 281 protein, nucleic acid expression or 
activity. 

° As an alternative to making determinations based on the absolute expression level 

of selected genes, determinations may be based on the normalized expression levels of 
these genes. Expression levels are normalized by correcting the absolute expression level 
of a TANGO 253, TANGO 257, INTERCEPT 258, or TANGO 281 gene by comparing its 
expression to the expression of a gene that is not a TANGO 253, TANGO 257, 

35 

INTERCEPT 258, or TANGO 281 gene, e.g., a housekeeping gene that is constitutively 
expressed. Suitable genes for normalization include housekeeping genes such as the actin 
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gene. This normalization allows the comparison of the expression level in one sample, 
eg-> a patient sample, to another sample, e.g., a sample from an individual without a 
particular disease or disorder, or a sample from a healthy individual, or between samples 
from different sources. 

5 Alternatively, the expression level can be provided as a relative expression level. 

To determine a relative expression level of a gene, the level of expression of the gene is 
determined for 10 or more samples of different cell isolates (e.g., neural cell isolates, glial 
cell isolates, immune cell isolates, platelet isolates, megakaryocyte isolates, endothelial 
cell isolates, and osteocyte isolates) preferably 50 or more samples, prior to the 

10 determination of the expression level for the sample in question. The mean expression 
level of each of the genes assayed in the larger number of samples is determined and this 
is used as a baseline expression level for the gene(s) in question. The expression level of 
the gene determined for the test sample (absolute level of expression) is then divided by 
the mean expression value obtained for that gene. This provides a relative expression 

1 5 level and aids in identifying extreme cases of diseases and disorders such as obesity, 
coronary disorders (e.g., atherosclerosis), neuronal disorders, pulmonary disorders, renal 
disorders, and bleeding disorders. 

Preferably, the samples used in the baseline determination will be from diseased or 
from non-diseased cells of the appropriate cell type or tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in normal 
tissues, as a mean expression score aids in validating whether the TANGO 253, TANGO 
257, INTERCEPT 258, or TANGO 281 gene assayed is specific (versus normal cells). 
Such a use is particularly important in identifying whether a TANGO 253, TANGO 257, 
INTERCEPT 258, or TANGO 281 gene can serve as a target gene. In addition, as more 
data is accumulated, the mean expression value can be revised, providing improved 
relative expression values based on accumulated data. Expression data from cells provides 
a means for grading the severity of the disease or disorder state. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g., 
30 drugs, compounds) on the expression or activity of TANGO 253, TANGO 257, 

INTERCEPT 258, or TANGO 281 in clinical trials. These and other agents are described 
in further detail in the following sections. 



20 
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1. Diagnostic Assays 

35 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid of the invention in a biological sample involves obtaining a biological sample 
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from a test subject and contacting the biological sample with a compound or an agent 
capable of detecting a polypeptide or nucleic acid (e.g., mRNA, genomic DNA) of the 
invention such that the presence of a polypeptide or nucleic acid of the invention is 
detected in the biological sample. A preferred agent for detecting mRNA or genomic 

5 DNA encoding a polypeptide of the invention is a labeled nucleic acid probe capable of 
hybridizing to mRNA or genomic DNA encoding a polypeptide of the invention. The 
nucleic acid probe can be, for example, a full-length cDNA, such as the nucleic acid of 
SEQ ID NO:l, 2, 8, 9, 15, 16, 21, 22, 26, 27, 37, 38, 46, 47, 56 or 57, or a portion thereof, 
such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and 

10 sufficient to specifically hybridize under stringent conditions to a mRNA or genomic 
DNA encoding a polypeptide of the invention. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

A preferred agent for detecting a polypeptide of the invention is an antibody 
capable of binding to a polypeptide of the invention, preferably an antibody with a 

1 ^ detectable label. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term "labeled", 
with regard to the probe or antibody, is intended to encompass direct labeling of the probe 
or antibody by coupling (i.e., physically linking) a detectable substance to the probe or 
antibody, as well as indirect labeling of the probe or antibody by reactivity with another 

20 reagent that is directly labeled. Examples of indirect labeling include detection of a 

primary antibody using a fluorescently labeled secondary antibody and end-labeling of a 
DNA probe with biotin such that it can be detected with fluorescently labeled streptavidin. 
The term "biological sample" is intended to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. That 

2$ is, the detection method of the invention can be used to detect mRNA, protein, or genomic 
DNA in a biological sample in vitro as well as in vivo. For example, in vitro techniques 
for detection of mRNA include Northern hybridizations and in situ hybridizations. In 
vitro techniques for detection of a polypeptide of the invention include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 

30 immunofluorescence. In vitro techniques for detection of genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for detection of a polypeptide of the 
invention include introducing into a subject a labeled antibody directed against the 
polypeptide. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

35 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 



- 121 - 



WO 00/78808 



PCT/US00/16883 



subject or genomic DNA molecules from the test subject. A preferred biological sample is 
a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 

^ capable of detecting a polypeptide of the invention or mRNA or genomic DNA encoding a 
polypeptide of the invention, such that the presence of the polypeptide or mRNA or 
genomic DNA encoding the polypeptide is detected in the biological sample, and 
comparing the presence of the polypeptide or mRNA or genomic DNA encoding the 
polypeptide in the control sample with the presence of the polypeptide or mRNA or 

10 genomic DNA encoding the polypeptide in the test sample. 

The invention also encompasses kits for detecting the presence of a polypeptide or 
nucleic acid of the invention in a biological sample (a test sample). Such kits can be used 
to determine if a subject is suffering from or is at increased risk of developing a disorder 
associated with aberrant expression of a polypeptide of the invention, as discussed, for 
example, in sections above relating to uses of the sequences of the invention. 

For example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as coronary disorders (e.g., heart diseases and disorders 
such as atherosclerosis., coronary artery disease and plaque formation), and adipocyte- 

20 related disorders (e.g., obesity), which are associated with aberrant TANGO 253 

expression. In another example, kits can be used to determine if a subject is suffering 
from or is at increased risk of disorders such as coronary disorders (e.g., heart diseases and 
disorders such as atherosclerosis, coronary artery disease and plague formation), olfactory 
disorders, neurological disorders (e.g., neurodegenerative disorders, neuromuscular 

25 disorders, cognitive disorders, personality disorders, and motor disorder) and pulmonary 
disorders, (e.g., cystic fibrosis), which are associated with aberrant TANGO 257 
expression. In another example, kits can be used to determine if a subject is suffering from 
or is at increased risk of disorders such as Type I immunologic disorders, (e.g., 
anaphylaxis and rhinitis), which are associated with aberrant INTERCEPT 258 expression. 

3Q In another example, kits can be used to determine if a subject is suffering from or is at 
increased risk of disorders such as immunological disorders, (e.g. thrombocytopenia and 
platelet disorders), developmental disorders, coronary disorders, e.g., ischemic heart 
disease or atherosclerosis, neurological disorders, (e.g., head trauma and brain cancer), 
pulmonary disorders, (e.g., lung cancer, cystic fibrosis and rheumatoid lung disease), 

35 kidney disorders, (e.g., glomerulonephritis and end stage renal disease), autoimmune 
disorders, (e.g., Crohn's disease) and embryonic disorders, which are associated with 
aberrant TANGO 281 expression. The kit, for example, can comprise a labeled compound 
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or agent capable of detecting the polypeptide or mRNA encoding the polypeptide in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 
the sample (e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
5 instructions for observing that the tested subject is suffering from or is at risk of 

developing a disorder associated with aberrant expression of the polypeptide if the amount 
of the polypeptide or mRNA encoding the polypeptide is above or below a normal level. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide of the invention; and, 
10 optionally, (2) a second, different antibody which binds to either the polypeptide or the 
first antibody and is conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 

^ acid sequence encoding a polypeptide of the invention or (2) a pair of primers useful for 
amplifying a nucleic acid molecule encoding a polypeptide of the invention. The kit can 
also comprise, e.g., a buffering agent, a preservative, or a protein stabilizing agent. The 
kit can also comprise components necessary for detecting the detectable agent (e.g., an 
enzyme or a substrate). The kit can also contain a control sample or a series of control 

^ samples which can be assayed and compared to the test sample contained. Each 

component of the kit is usually enclosed within an individual container and all of the 
various containers are within a single package along with instructions for observing 
whether the tested subject is suffering from or is at risk of developing a disorder 
associated with aberrant expression of the polypeptide. 

25 

2. Prognostic Assays 

The methods described herein can furthermore be utilized as diagnostic or 
prognostic assays to identify subjects having or at risk of developing a disease or disorder 
associated with aberrant expression or activity of a polypeptide of the invention. For 

30 

example, the assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with aberrant expression or activity of a polypeptide of the invention, 
e.g., coronary disorders, pulmonary disorders, kidney disorders or embryonic disorders. 
Alternatively, the prognostic assays can be utilized to identify a subject having or at risk 

35 

for developing such a disease or disorder. Thus, the present invention provides a method 
in which a test sample is obtained from a subject and a polypeptide or nucleic acid (e.g., 
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15 



mRNA, genomic DNA) of the invention is detected, wherein the presence of the 
polypeptide or nucleic acid is diagnostic for a subject having or at risk of developing a 
disease or disorder associated with aberrant expression or activity of the polypeptide. As 
used herein, a "test sample" refers to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological fluid (e.g., serum), cell sample, or 
tissue. 

The prognostic assays described herein, for example, can be used to identify a 
subject having or at risk of developing disorders such as disorders discussed, for example, 
in Sections above relating to uses of the sequences of the invention. 

For example, such disorders can include coronary disorders (e.g., heart diseases 
and disorders such as atherosclerosis, coronary artery disease and plague formation) and 
adipocyte disorders (e.g., obesity), which are associated with aberrant TANGO 253 
expression. In another example, prognostic assays described herein, can be used to 
identify a subject having or at risk of developing disorders such as coronary disorders 
(e.g., heart diseases and disorders such as atherosclerosis, coronary artery disease and 
plague formation), olfactory disorders, neurological disorders (e.g., neurodegenerate 
disorders, neuromuscular disorders, cognitive disorders, personality disorders, and motor 
disorders), and pulmonary disorders, (e.g., cystic fibrosis), which are associated with 
aberrant TANGO 257 expression. In another example, prognostic assays described herein, 
can be used to identify a subject having or at risk of developing disorders such as Type I 
immunologic disorders, (e.g., anaphylaxis and rhinitis), which are associated with aberrant 
INTERCEPT 258 expression. In another example, prognostic assays described herein, for 
example, can be used to identify a subject having or at risk of developing disorders such as 
immunological disorders, (e.g. thrombocytopenia and platelet disorders), developmental 
disorders, coronary disorders, (e.g., ischemic heart disease and atherosclerosis), 
neurological disorders, (e.g., head trauma and brain cancer), pulmonary disorders, (e.g., 
lung cancer, cystic fibrosis and rheumatoid lung disease), kidney disorders, (e.g., 
glomerulonephritis and end stage renal disease), autoimmune disorders, (e.g., Crohn's 
disease) and embryonic disorders, which are associated with aberrant TANGO 281 
expression. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
35 treat a disease or disorder associated with aberrant expression or activity of a polypeptide 
of the invention. For example, such methods can be used to determine whether a subject 
can be effectively treated with a specific agent or class of agents (e.g., agents of a type 
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which decrease activity of the polypeptide). Thus, the present invention provides methods 
for determining whether a subject can be effectively treated with an agent for a disorder 
associated with aberrant expression or activity of a polypeptide of the invention in which a 
test sample is obtained and the polypeptide or nucleic acid encoding the polypeptide is 
5 detected (e.g., wherein the presence of the polypeptide or nucleic acid is diagnostic for a 
subject that can be administered the agent to treat a disorder associated with aberrant 
expression or activity of the polypeptide). 

The methods of the invention can also be used to detect genetic lesions or 
mutations in a gene of the invention, thereby determining if a subject with the lesioned 

*0 gene is at risk for a disorder characterized aberrant expression or activity of a polypeptide 
of the invention. In preferred embodiments, the methods include detecting, in a sample of 
cells from the subject, the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting the integrity of a gene encoding the 
polypeptide of the invention, or the mis-expression of the gene encoding the polypeptide 

* 5 of the invention. For example, such genetic lesions or mutations can be detected by 

ascertaining the existence of at least one of: 1) a deletion of one or more nucleotides from 
the gene; 2) an addition of one or more nucleotides to the gene; 3) a substitution of one or 
more nucleotides of the gene; 4) a chromosomal rearrangement of the gene; 5) an 
alteration in the level of a messenger RNA transcript of the gene; 6) an aberrant 

20 modification of the gene, such as of the methylation pattern of the genomic DNA; 7) the 
presence of a non-wild type splicing pattern of a messenger RNA transcript of the gene; 8) 
a non-wild type level of a the protein encoded by the gene; 9) an allelic loss of the gene; 
and 10) an inappropriate post-translational modification of the protein encoded by the 
gene. As described herein, there are a large number of assay techniques known in the art 

25 which can be used for detecting lesions in a gene. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and Nakazawa 
et al. (1994) Proc. Natl. Acad. ScL USA 91:360-364), the latter of which can be 
particularly useful for detecting point mutations in a gene (see, e.g., Abravaya et al. (1995) 
Nucleic Acids Res. 23:675-682). This method can include the steps of collecting a sample 
of cells from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells 
of the sample, contacting the nucleic acid sample with one or more primers which 
specifically hybridize to the selected gene under conditions such that hybridization and 
amplification of the gene (if present) occurs, and detecting the presence or absence of an 
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amplification product, or detecting the size of the amplification product and comparing the 
length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with any of the techniques used for 
detecting mutations described herein. 

5 Alternative amplification methods include: self sustained sequence replication 

(Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 6:1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 

10 techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in a selected gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use of sequence specific 
^ ribozymes (see, e.g., U.S. Patent No. 5,498,531) can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations can be identified by hybridizing a sample 
and control nucleic acids, e.g., DNA or RNA, to high density arrays containing hundreds 
or thousands of oligonucleotides probes (Cronin et al. (1996) Human Mutation 7:244-255; 

25 Kozal et al. (1996) Nature Medicine 2:753-759). For example, genetic mutations can be 
identified in two-dimensional arrays containing light-generated DNA probes as described 
in Cronin et al., supra. Briefly, a first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to identify base changes between 
the sequences by making linear arrays of sequential overlapping probes. This step allows 

3q the identification of point mutations. This step is followed by a second hybridization array 
that allows the characterization of specific mutations by using smaller, specialized probe 
arrays complementary to all variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

35 In yet another embodiment, any of a variety of sequencing reactions known in the 

art can be used to directly sequence the selected gene and detect mutations by comparing 
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the sequence of the sample nucleic acids with the corresponding wild-type (control) 
sequence. Examples of sequencing reactions include those based on techniques developed 
by Maxim and Gilbert ((1977) Proc. Natl. Acad. Sci. USA 74:560) or Sanger ((1977) 
Proc. Nati Acad. Sci. USA 74:5463). It is also contemplated that any of a variety of 
5 automated sequencing procedures can be utilized when performing the diagnostic assays 
((1995) Bio/Techniques 19:448), including sequencing by mass spectrometry ( see, e.g., 
PCT Publication No. WO 94/16101 ; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; 
and Griffin et al. (1993) Appl. Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in a selected gene include methods in which 
*0 protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 230:1242). In general, the 
technique of mismatch cleavage entails providing heteroduplexes formed by hybridizing 
(labeled) RNA or DNA containing the wild-type sequence with potentially mutant RNA or 
DNA obtained from a tissue sample. The double-stranded duplexes are treated with an 
1 5 agent which cleaves single-stranded regions of the duplex such as which will exist due to 
basepair mismatches between the control and sample strands. RNA/DNA duplexes can be 
treated with RNase to digest mismatched regions, and DNA/DNA hybrids can be treated 
with SI nuclease to digest mismatched regions. 

In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated 

20 

with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is 
then separated by size on denaturing polyacrylamide gels to determine the site of 
mutation. See, e.g., Cotton et al. (1988) Proc. Natl. Acad. Sci. USA 85:4397; Saleeba et al. 
^ (1992) Methods Enzymol. 217:286-295. In a preferred embodiment, the control DNA or 
RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 

30 in cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on a selected sequence, e.g., a 
wild-type sequence, is hybridized to a cDNA or other DNA product from a test cell(s). 

35 The duplex is treated with a DNA mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 
5,459,039. 
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In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in genes. For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant and 
wild type nucleic acids (Orita et al. (1989) Proc. Natl. Acad Sci. USA 86:2766; see also 

5 Cotton (1993) MutaL Res. 285:125-144; Hayashi (1992) Genet. Anal. Tech. Appl. 

9:73-79). Single-stranded DNA fragments of sample and control nucleic acids will be 
denatured and allowed to renature. The secondary structure of single-stranded nucleic 
acids varies according to sequence, and the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The DNA fragments may be labeled or 

10 detected with labeled probes. The sensitivity of the assay may be enhanced by using RNA 
(rather than DNA), in which the secondary structure is more sensitive to a change in 
sequence. In a preferred embodiment, the subject method utilizes heteroduplex analysis to 
separate double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

1 5 In yet another embodiment, the movement of mutant or wild-type fragments in 

polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When DGGE 
is used as the method of analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a 'GC clamp of approximately 40 bp of 

20 high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control and 
sample DNA (Rosenbaum and Reissner (1987) Biophys. Chem. 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 
^ limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki et al. (1989) Proc. Natl. Acad. Sci. USA 86:6230). Such allele specific 
^ oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 
35 Oligonucleotides used as primers for specific amplification may cany the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et al. (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
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end of one primer where, under appropriate conditions, mismatch can prevent or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1 :238). In addition, it may be desirable 
to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et al. (1992) Mol Cell Probes 6: 1). It is anticipated that in certain 
5 embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad. Sci. USA 88:189). In such cases, ligation will occur 
only if there is a perfect match at the 3' end of the 5* sequence making it possible to detect 
the presence of a known mutation at a specific site by looking for the presence or absence 
of amplification. 

10 The methods described herein may be performed, for example, by utilizing 

pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, e.g., in clinical settings to 
diagnose patients exhibiting symptoms or family history of a disease or illness involving a 
gene encoding a polypeptide of the invention. Furthermore, any cell type or tissue, eg., 

15 chondrocytes, in which the polypeptide of the invention is expressed may be utilized in the 
prognostic assays described herein. 



3. Pharmacogenomics 

20 Agents, or modulators which have a stimulatory or inhibitory effect on activity or 

expression of a polypeptide of the invention as identified by a screening assay described 
herein can be administered to individuals to treat (prophylactically or therapeutically) 
disorders associated with aberrant activity of the polypeptide. In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an 

25 individual's genotype and that individual's response to a foreign compound or drug) of the 
individual may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the 
individual permits the selection of effective agents {e.g., drugs) for prophylactic or 

30 therapeutic treatments based on a consideration of the individual's genotype. Such 

pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of a polypeptide of the invention, expression of a 
nucleic acid of the invention, or mutation content of a gene of the invention in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 

35 prophylactic treatment of the individual. 
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Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See, e.g., Under (1997) Clin. Chem. 43(2):254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 

5 single factor altering the way drugs act on the body are referred to as "altered drug action." 
Genetic conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the main 

10 clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes {e.g., N-acetyltransferase 2 (NAT 

15 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated drug 
response and serious toxicity after taking the standard and safe dose of a drug. These 
polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

20 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 
If a metabolite is the active therapeutic moiety, a PM will show no therapeutic response, 

25 as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 

metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who do 
not respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism 
has been identified to be due to CYP2D6 gene amplification. 

Thus, the activity of a polypeptide of the invention, expression of a nucleic acid 
encoding the polypeptide, or mutation content of a gene encoding the polypeptide in an 
individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. In addition, pharmacogenetic studies can be used 
to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 
^ identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and 
thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
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modulator of activity or expression of the polypeptide, such as a modulator identified by 
one of the exemplary screening assays described herein. 

4. Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of a polypeptide of the invention (e.g., the ability to modulate aberrant cell 
proliferation chemotaxis, and/or differentiation) can be applied not only in basic drug 
screening, but also in clinical trials. For example, the effectiveness of an agent, as 

]0 determined by a screening assay as described herein, to increase gene expression, protein 
levels or protein activity, can be monitored in clinical trials of subjects exhibiting 
decreased gene expression, protein levels, or protein activity. Alternatively, the 
effectiveness of an agent, as determined by a screening assay, to decrease gene expression, 
protein levels or protein activity, can be monitored in clinical trials of subjects exhibiting 

15 increased gene expression, protein levels, or protein activity. In such clinical trials, 
expression or activity of a polypeptide of the invention and preferably, that of other 
polypeptide that have been implicated in for example, a cellular proliferation disorder, can 
be used as a marker of the immune responsiveness of a particular cell. 

For example, and not by way of limitation, genes, including those of the invention, 
20 that are modulated in cells by treatment with an agent (e.g., compound, drug or small 
molecule) which modulates activity or expression of a polypeptide of the invention (e.g., 
as identified in a screening assay described herein) can be identified. Thus, to study the 
effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells can 
be isolated and RNA prepared and analyzed for the levels of expression of a gene of the 
25 invention and other genes implicated in the disorder. The levels of gene expression (i.e., a 
gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced, by one of 
the methods as described herein, or by measuring the levels of activity of a gene of the 
invention or other genes. In this way, the gene expression pattern can serve as a marker, 
30 indicative of the physiological response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various points during, treatment of the 
individual with the agent. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) comprising the steps of (i) obtaining a 
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pre-administration sample from a subject prior to administration of the agent; (ii) detecting 
the level of the polypeptide or nucleic acid of the invention in the preadministration 
sample; (iii) obtaining one or more post-administration samples from the subject; (iv) 
detecting the level the of the polypeptide or nucleic acid of the invention in the 

5 post-administration samples; (v) comparing the level of the polypeptide or nucleic acid of 
the invention in the pre-administration sample with the level of the polypeptide or nucleic 
acid of the invention in the post-administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased 
administration of the agent may be desirable to increase the expression or activity of the 

10 polypeptide to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of the polypeptide to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 



C. Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant expression or activity of a polypeptide of the invention, as discussed, for 
20 example, in sections above relating to uses of the sequences of the invention. 

For example, disorders characterized by aberrant expression or activity of the 
polypeptides of the invention include immunologic disorders, coronary disorders, 
pulmonary disorders, neurological disorders, kidney disorders, and autoimmune disorders. 
The nucleic acids, polypeptides, and modulators thereof of the invention can be used to 

25 treat immunologic diseases and disorders, including but not limited to, allergic disorders 
(e.g., anaphylaxis and allergic asthma) autoimmune and inflammatory disorders (eg., 
atopic dermatitis). Polypeptides of the invention can be used to treat diseases associated 
with bacterial infection (e.g., tuberculosis, e.g., pulmonary tuberculosis), inflammatory 
arthropathy, and bone and cartilage degenerative diseases and disorders (e.g., arthritis, 

30 e.g., rheumatoid arthritis). Polypeptides of the invention can be used to treat pulmonary 
disorders such as lung cancer, cystic fibrosis and rheumatoid lung diseases. Polypeptides 
of the invention can be used to treat coronary disorders, such as ischemic heart disease, 
atherosclerosis and plague formation. Polypeptides of the invention can also be used to 
treat neurological disorders such as neurodegenerate disorders, neuromuscular disorders 

35 and cognitive disorders. Polypeptides of the invention can also be used to treat kidney 
disorders such as glomerulonephritis and end stage renal disease. Further, polypeptides of 
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the invention can be used to treat autoimmune disorders such as Crohns disease, and other 
disorders described herein. 



1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant expression or activity of a polypeptide of 
the invention, by administering to the subject an agent which modulates expression or at 
least one activity of the polypeptide. Subjects at risk for a disease which is caused or 

lO contributed to by aberrant expression or activity of a polypeptide of the invention can be 
identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the 
manifestation of symptoms characteristic of the aberrancy, such that a disease or disorder 
is prevented or, alternatively, delayed in its progression. Depending on the type of 

15 aberrancy, for example, an agonist or antagonist agent can be used for treating the subject. 
For example, an antagonist of a TANGO 253, TANGO 257, INTERCEPT 258 or TANGO 
281 proteins may be used to treat an immunologic disorder, e.g., rheumatoid arthritis. The 
appropriate agent can be determined based on screening assays described herein. 

20 

2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression or 
activity of a polypeptide of the invention for therapeutic purposes. The modulatory 
method of the invention involves contacting a cell with an agent that modulates one or 

25 more of the activities of the polypeptide. An agent that modulates activity can be an agent 
as described herein, such as a nucleic acid or a protein, a naturally-occurring cognate 
ligand of the polypeptide, a peptide, a peptidomimetic, or other small molecule. In one 
embodiment, the agent stimulates one or more of the biological activities of the 
polypeptide. Examples of such stimulatory agents include the active polypeptide of the 
invention and a nucleic acid molecule encoding the polypeptide of the invention that has 
been introduced into the cell. In another embodiment, the agent inhibits one or more of 
the biological activities of the polypeptide of the invention. Examples of such inhibitory 
agents include antisense nucleic acid molecules and antibodies. These modulatory 
methods can be performed in vitro {e.g., by culturing the cell with the agent) or, 

35 alternatively, in vivo {e.g., by administering the agent to a subject). As such, the present 
invention provides methods of treating an individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of a polypeptide of the invention. In one 
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embodiment, the method involves administering an agent (e.g., an agent identified by a 
screening assay described herein), or combination of agents that modulates (e.g., 
upregulates or downregulates) expression or activity. In another embodiment, the method 
involves administering a polypeptide of the invention or a nucleic acid molecule of the 
5 invention as therapy to compensate for reduced or aberrant expression or activity of the 
polypeptide. 

Stimulation of activity is desirable in situations in which activity or expression is 
abnormally low or downregulated and/or in which increased activity is likely to have a 
beneficial effect. Conversely, inhibition of activity is desirable in situations in which 
' 0 activity or expression is abnormally high or upregulated and/or in which decreased activity 
is likely to have a beneficial effect. 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 



Deposit of Clones 

Clones containing cDNA molecules encoding human TANGO 253, (clone 
EpT253) human TANGO 257 (EpT257), human INTERCEPT 258 (clone EpT258) and 

20 

human TANGO 281 (clone EpT 281) were deposited with the American Type Culture 
Collection, 10801 University Boulevard, Manassas, VA, 201 10-2209, on April 21, 1999 
as Accession Number 207222, as part of a composite deposit representing a mixture of 
strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

25 For this composite deposit, to distinguish the strains and isolate a strain harboring 

a particular cDNA clone, an aliquot of the mixture can be streaked out to single colonies 
on nutrient medium (e.g., LB plates) supplemented with lOOg/ml ampicillin, single 
colonies grown, and then plasmid DNA extracted using a standard minipreparation 
procedure. Next, a sample of the DNA minipreparation can be digested with a 

3 q combination of the restriction enzymes Sail, Notl, Xbal and EcorV and the resultant 

products resolved on a 0.8% agarose gel using standard DNA electrophoresis conditions. 
The digest liberates fragments as follows: 

Human TANGO 253 (clone EpT253): 1.3 kb 

35 

Human TANGO 257 (clone EpT257): 1.8 kb 



- 134- 



WO 00/78808 



PCT/USOO/16883 



Human INTERCEPT 258 (clone EpT258): 1 .0 kb and 0.85 kb (human 
INTERCEPT 258 has a EcorV cut site at about bp 1004). 

Human TANGO 281 (clone EpT281): 0.9 kb and 0.9kb (human TANGO 
281 Has an Xbal cut site at about bp 900). 

5 

The identity of the strains can be inferred from the fragments liberated. - 

Clones containing cDNA molecules encoding mouse INTERCEPT 258 were 
deposited with the American Type Culture Collection (Manassas, VA) on April 21, 1999 
1 0 as Accession Number 20722 1 , as part of a composite deposit representing a mixture of 
five strains, each carrying one recombinant plasmid harboring a particular cDNA clone. 

To distinguish the strains and isolate a strain harboring a particular cDNA clone, 
an aliquot of the mixture can be streaked out to single colonies on nutrient medium (e.g., 
LB plates) supplemented with 100^g/ml ampicillin, single colonies grown, and then 
plasmid DNA extracted using a standard minipreparation procedure. Next, a sample of the 
DNA minipreparation can be digested with a combination of the restriction enzymes Sail, 
and Noil, and the resultant products resolved on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digest liberates fragments as follows: 



15 



20 



Mouse INTERCEPT 258 (clone EpT258): 1 .8 kb 



The identity of the strains can be inferred from the fragments liberated. 

A clone containing a cDNA molecule encoding mouse TANGO 253 (Clone EpTm 
253) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207215. 

A clone containing a cDNA molecule encoding mouse TANGO 257 (Clone EpTm 
257) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on April 21, 1999 as Accession Number 207217. 

A clone containing a cDNA molecule encoding mouse TANGO 281 (Clone EpTm 
281) was deposited with American Type Culture Collection, 10801 University Boulevard, 
Manassas, VA 201 10-2209, on June 15, 1999 as patent deposit Number PTA-224. 



- 135- 



WO 00/78808 



PCT/US00/16883 



All publications, patents and patent applications mentioned in this specification are 
herein incorporated by reference into the specification to the same extent as if each 
individual publication, patent or patent application was specifically and individually 
indicated to be incorporated herein by reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:l, 2, 26, 27, 46, 47, the cDNA 
insert of the plasmid deposited with the ATCC® as Accession Number 207222, or a 
complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:l, 2, 15, 16, 26, 27, 46, 47, the cDNA insert of 
the plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

c) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:3, 28, 48, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ED NO: 3, 28, 48, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222; 

e) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 1 7, 28, 48, or the 
amino acid sequence encoded by the cDNA insert of the plasmid deposited with the 
ATCC® as Accession Number 207222, wherein the nucleic acid molecule hybridizes to a 
nucleic acid molecule comprising SEQ ID NO:2, 16, 27, 47, or a complement thereof 
under stringent conditions; 

f) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ID NO:21, 22, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207217, or a complement 
thereof; 

g) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:21, 22, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 20721 7, or a complement thereof; 
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h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207217; 

i) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:23, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207217, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207217; 

j) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:23, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ED NO:22, or a complement thereof under stringent conditions; 

k) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ED NO:37, 38, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207221, or a complement 
thereof; 

1) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:37, 38, the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207221 , or a complement thereof; 

m) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207221 ; 

n) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:39, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207221, wherein the fragment comprises at least 160 contiguous amino acids of 
SEQ ID NO:39, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 20722 1 ; 

o) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:39, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
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Accession Number 207217, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:38, or a complement thereof under stringent conditions; 

p) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:8, 9, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 20721 5, or a complement 
thereof; 

q) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ED NO: 8, 9, the cDNA insert of the plasmid deposited 
with the ATCC® as Accession Number 207215, or a complement thereof; 

r) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ED NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207215; 

s) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207215, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207215; 

t) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO: 10, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207215, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ED NO:9, or a complement thereof under stringent conditions; 

u) a nucleic acid molecule comprising a nucleotide sequence which is at least 
95% identical to the nucleotide sequence of SEQ ED NO:l 5, 16, the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, or a complement 
thereof; 

v) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 17, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, wherein the fragment comprises at least 360 contiguous amino acids of 
SEQ ID NO: 17, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as Accession Number 207222. 
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w) a nucleic acid molecule comprising a nucleotide sequence which is at least 
45% identical to the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the 
plasmid deposited with the ATCC® as patent deposit Number PTA-224, or a complement 
thereof; 

x) a nucleic acid molecule comprising a fragment of at least 300 nucleotides 
of the nucleotide sequence of SEQ ID NO:56, 57, the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224, or a complement thereof; 

y) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224; 

z) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ED NO:58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent deposit 
Number PTA-224, wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 5 8, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC® as patent deposit Number PTA-224; 

aa) a nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:58, or the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with the ATCC® as patent 
deposit Number PTA-224, wherein the nucleic acid molecule hybridizes to a nucleic acid 
molecule comprising SEQ ID NO:57, or a complement thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, which is selected from the 
group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1,2, 15, 
16, 26, 27, 46, 47, the cDNA insert of the plasmid deposited with the ATCC® as 
Accession Number 207222, or a complement thereof; 

b) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 17, 28, 48, or the amino acid sequence encoded by the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222; 

c) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:2 1 , 22, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207217, 
or a complement thereof; 
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d) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:23, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 20721 7; 

e) a nucleic acid comprising the nucleotide sequence of SEQ ED NO:37, 38, 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207221, 
or a complement thereof; 

0 a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ED NO:39, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207221; 

g) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:8, 9, the 
cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207215, or 
a complement thereof; 

h) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO: 10, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222. 

i) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:56, 57, 
the cDNA insert of the plasmid deposited with the ATCC® as patent deposit Number 
PTA-224, or a complement thereof; 

j) a nucleic acid molecule which encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as patent deposit Number PTA-224. 

3. The nucleic acid molecule of claim 1 further comprising vector nucleic acid 
sequences. 

4. The nucleic acid molecule of claim 1 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell 
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7. A non-human mammalian host cell containing the nucleic acid molecule of 
claim 1. 



8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58; 

b) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ED NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of plasmids deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, patent deposit Number PTA-224 wherein the polypeptide is encoded by 
a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ ED 
NO:2, 9, 16, 22, 27, 38, 47, 57, or a complement thereof under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
nucleotide sequence which is at least 45% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:2, 9, 27, 38, 47, 57, or at least 98% to a nucleic acid 
comprising the nucleotide sequence of SEQ ED NO:2, 9, 27, 38, 47, 57, or a complement 
thereof. 

9. The isolated polypeptide of claim 8 comprising the amino acid sequence of 
SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58. 

10. The polypeptide of claim 8 further comprising heterologous amino acid 
sequences. 

11. An antibody which selectively binds to a polypeptide of claim 8. 



12. A method for producing a polypeptide selected from the group consisting 

of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID NO:3, 10, 
17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with the ATCC® as Accession Number 207222, Accession Number 
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207215, Accession Number 207217, Accession Number 207221, or patent deposit 
Number PTA-224; 

b) a polypeptide comprising a fragment of the amino acid sequence of SEQ ID 
NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by the cDNA insert 
of the plasmid deposited with the ATCC® as Accession Number 207222, Accession 
Number 207215, Accession Number 207217, Accession Number 207221, or patent 
deposit Number PTA-224, wherein the fragment comprises at least 15 contiguous amino 
acids of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC® as Accession Number 207222, 
Accession Number 207215, Accession Number 207217, Accession Number 207221 or 
patent deposit Number PTA-224; and 

c) a naturally occurring allelic variant of a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3, 10, 17, 23, 28, 39, 48, 58, or the amino acid sequence 
encoded by the cDNA insert of the plasmid deposited with the ATCC® as Accession 
Number 207222, Accession Number 207215, Accession Number 207217, Accession 
Number 207221, or patent deposit Number PTA-224, wherein the polypeptide is encoded 
by a nucleic acid molecule which hybridizes to a nucleic acid molecule comprising SEQ 
ID NO:l, 8, 15, 21, 26, 37, 46, 56, or a complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in which the nucleic 
acid molecule is expressed. 

13. A method for detecting the presence of a polypeptide of claim 8 in a 
sample, comprising: 

a) contacting the sample with a compound which selectively binds to a 
polypeptide of claim 8; and 

b) determining whether the compound binds to the polypeptide in the sample. 

14. The method of claim 1 3, wherein the compound which binds to the 
polypeptide is an antibody. 

15. A kit comprising a compound which selectively binds to a polypeptide of 
claim 8 and instructions for use. 
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16. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer which selectively 
hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample. 

17. The method of claim 16, wherein the sample comprises mRNA molecules 
and is contacted with a nucleic acid probe. 

18. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of claim 1 and instructions for use. 

19. A method for identifying a compound which binds to a polypeptide of 
claim 8 comprising the steps of: 

a) contacting a polypeptide, or a cell expressing a polypeptide of claim 8 with 
a test compound; and 

b) determining whether the polypeptide binds to the test compound. 

20. The method of claim 19, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detecting of test compound/polypeptide 
binding; 

b) detection of binding using a competition binding assay; 

c) detection of binding using an assay for TANGO 253, TANGO 257, 
INTERCEPT 258, TANGO 281 -mediated signal transduction. 

21. A method for modulating the activity of a polypeptide of claim 8 
comprising contacting a polypeptide or a cell expressing a polypeptide of claim 8 with a 
compound which binds to the polypeptide in a sufficient concentration to modulate the 
activity of the polypeptide. 
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22. A method for identifying a compound which modulates the activity of a 
polypeptide of claim 8, comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity of the 
polypeptide. 
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GTCGACCCACGCGTCCGGGACTGGGGTGACGGCAC^CAGGGGGCGCCTGGCCG<^AGAAGCGCGGGGGCTGGAGCAC 
CACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGGGGGGACTGCGAGAGGACC 



79 
158 



HRPLLVLLLLGL 12 
CCGGCGTCCGGGCTCCCGGTGCCAGCGCT ATG AGG CCA CTC CTC GTC CTG CTG CTC CTG GGC CTG 22 3 

AAGSPPLDDNKIPSLCPGHP 32 
GCG GCC GGC TCG CCC CCA CTG GAC GAC AAC AAG ATC CCC AGC CTC TGC CCG GGG CAC CCC 283 

GLPGTPGHHGSQGLPGRDGR 52 
GGC CTT CCA GGC ACG CCG GGC CAC CAT GGC AGC CAG GGC TTG CCG GGC CGC GAT GGC CGC 34 3 

DGRDGAPGAPGEKGEGGRPG 72 
GAC GGC CGC GAC GGC GCG CCC GGG GCT CCG GGA GAG AAA GGC GAG GGC GGG AGG CG GGA 403 

LPGPRGDPGPRGEAGPAGPT 92 
CTG CCG GGA CCT CGA GGG GAC CCC GGG CCG CGA GGA GAG GCG GGA CCC GCG GGG CCC ACC 4 83 

GPAGECSVPP RSAFSAKRSE112 
GGG CCT GCC GGG GAG TGC TCG GTG CCT CCG CGA TCC GCC TTC AGC GCC AAG CGC TCC GAG 523 

SRVPPPSDAPLPFDRVLVNE132 
AGC CGG GTG CCT CCG CCG TCT GAC GCA CCC TTG CCC TTC GAC CGC GTG CTG GTG AAC GAG S83 

OG HYDAVTGKFTCQVPGVYY1S2 
CAG GGA CAT TAG GAC GCC GTC ACC GGC AAG TTC ACC TGC CAG GTG CCT GGG GTC TAG TAC 643 

FAVHATVYRASLQFDLVKNG172 
TTC GCC GTC CAT GCC ACC GTC TAC CGG GCC AGC CTG CAG TTT GAT CTG GTG AAG AAT GGC 703 

ESIASFFQ FFGGWPKPASL S192 



GAA TCC ATT GCC TCT TTC TTC CAG TTT TTC GGG GGG TGG CCC AAG CCA GCC TCG CTC TCG 763 
rr A MVRLEPEDQVWVQVGVG212 
GGG GGG GCC ATG -GTG AGG CTG GAG CCT GAG GAC CAA GTG TGG GTG CAG GTG GGT GTG GGT 823 

GAC tIc ATT gSc ATC tIt GCC AGC ATC AAG ACA GAC AGC ACC TTC TCC GGA TTT CTG GTG 683 



YSDWHSSPVFA* 244 
TAC TCC GAC TGG CAC AGC TCC CCA GTC TTT GCT TAG 919 

TOXCACrGCAAAGTGAGCTCATGCTCT 953 
GGCCCCCCTXXaAATATTGTGAATGACTAGGGAGGTGGGGTAGAGCAC^ 1077 

FIG. 1A 
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ACnKXXrPGTCTGCGATCAGGTCTGGCA^ 1156 

GGCAAGTGTAAGTCCCCCAGTTGCTCTGGTCCAGGAGCC^ 1235 

ATCCTCCCCACCCCCTCCTGCTCCTGGGGCCGGCCCTTT^ 1314 
AAAAAAAAAAAAAAAGGG CGGCCG C 133 9 



FIG. 1B 
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| i | i | i | i | i | i | . ( i | i | i | i | i | 
1 41 81 121 161 201 241 



■MRPLLVLLLIiGLJVAGSPPL 

APGEKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSD 

APLPFDRV^VNEQGHYDAVTGKFTCQV^ IASFFQ 
FFGGWPKPASLSGGAMVRLEPEIXJVVA/QVGVGDYIGIYASIKTDSTFSGFLVYSDWHS 

VFA 



FIG. 2 
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GTCGACCCACGCGTCCGCGCTGTGAAGCCAGCAA^ 7 9 

M R P L L A 6 

CTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCX3GGGTCACCACC ATG AGG CCA CTT CTT GCC 152 

LLLLGLVS 6- SPPLDDNKIPS 26 

CTT CTG CTT CTG GGT CTG GTG TCA GGC TCT CCT CCT CTG GAC GAC AAC AAG ATC CCC AGC 212 

LC PGQPG LPGTPGHHGSQGL 46 

CTG TGT CCC GGG CAG CCC GGC CTT CCA GGC ACA CCA GGT CAC CAT GGC AGC CAA GGC CTG 272 

PGRDGRDGRDGAPGAPGEKG 66 

CCT GGC CGT GAC GGC CGT GAT GGC CGC GAC GGT GCA CCC GGA GCT CCG GGA GAG AAA GGC 332 

EGG RPGL PGPRGEPGPRGEA 86 

GAG GGC GGG AGA CCG GGA CTA CCT GGC CCA CGT GGG GAG CCC GGG CCG CGT GGA GAG GCA 392 

GPMGAIG PAGECSVPPRSAF 106 

GGG CCC ATG GGG GCT ATC GGG CCT GCG GGG GAG TGC TCG GTA CCC CCA CGA TCA GCC TTC 4 52 

SAKRSES RVPPPADTPLPFD 126 

AGT GCC AAG CGA TCC GAG AGC CGG GTA CCT CCG CCA GCC GAC ACA CCC CTA CCT TTC GAC 512 

RVL LNEQ GHYDPTTGK FTCQ 146 

CGT GTG CTG CTA AAT GAG CAG GGC CAT TAC GAC CCC ACT ACT GGC AAG TTC ACC TGC CAA 572 

VPGVYYFAVHATVYRASLQF 166 

GTG CCT GGC GTC TAC TAC TTT GCT GTG CAC GCC ACT GTC TAC CGG GCC AGC TTG CAG TTT 632 

DLVKNGQS IASFFQYFGGWP 186 

SAT CTT GTC AAA AAC GGG CAG TCC ATC GCC TCT TTC TTC CAG TAT TTT GGG GGG TGG CCC 692 

KPASLSGGAMVRLEPEDQVW 206 

\AG CCA GCC TCG CTC TCA GGG GGT GCG ATG GTA AGG CTA GAA CCT GAG GAC CAG GTG TGG 752 

VQ V G VGD Y I 6 Z Y A S . X KTDS T 226 

5TG CAG GTG GGC GTG GGT GAT TAC ATT GGC ATC TAT GCC AGC ATC AAG ACA GAC AGT ACC 812 

FS G F LVY S DWHSS PVFA* 244 

TTC TCT GGA TTT CTC GTC TAT TCT GAC TGG CAC AGC TCC CCA GTC TTC GCT TAA 866 

/IACACAGTK3AACCCGGAGCTGGCACTTGCTCCTCAGT 945 

TGGCCCCCTGGAATATTGTGAATGACTTAGGAAGAG^ 1024 

^GCTGTCTGAGGTCAAGACAGCGTGGAGC^ 1103 

TSGGTCXrTGGCCCAGGACTCCAAGGTGG^ 1182 



FIG. 3A 
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GCTGCTCCCAG<XJCAG<3 CCTTTTTCTCAGAGG TCACTTAATAAACCTAAAATCCTCAAAAAAAAAAAAAAAGGGCGGCC 1261 
GC 1263 



FIG. 3B 



5/61 



WO 00/78808 



PCT/USOO/16883 



I . j . | i | i , . | i | . , i | i | , , . | . , 
1 41 fti i?j iGl 201 241 

>mT253 

MRPLl^LLUSLVSGSPPLDD^IPSLCPGQrc^ 

APGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAJFSAKRSESRVPPPAD 
TPLPFDRVLLNEQGHYDPTTGKFTCQVPGVYYFAVHATV 

YFGGWPKPASLSGGAjMVRLEPEDQWA/QVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSP 
VFA 



FIG. 4 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Kyero and Killer, CABIQS (1989) 

> hT2S3 a. a. 243 aa vs. 

> inT253 a. a. 243 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

93.8% identity; Global alignment score: 123 9 

10 20 30 40 SO 60 70 

inputs MRPIXVT.LiU3LAAGSPPL^DhTKIPSLCPGH 

KRPLLAJLLLLGLVSGSPPLDDNKI PSIXPGQPGLPGTPGHHGSOGLPGRIXJPJXIRIXiAPGAPGEKGEGGR 
10 20 30 40 SO €0 70 



60 90 100 110 120 130 140 

inputs PGLPGPRGDPGPRGEAGPAGPTGPAGECSVTPRSAFSAKRSESRVTPPSDAPLPFTDRVT.VKEQGKYDA^ 

PGLPGPRGEPGPRGEAGPKGAIGPAGECSVPPRSAFSAKRSESRVPPPADTPLPFB 

60 90 100 110 120 130 140 

ISO 160 170 180 190 200 210 

inputs CKfTCQVPGVYYFAVHATVYRASWFDLVK^CESIA^^ 

G KFTCQVPGVYYF AVHATVYRAS LQFOLVXNGQS IASFFQYFGGWPKPASLSGGAKVRLEPEDQVWQVG 
ISO 160 170 180 190 200 210 

220 230 240 

inpu t S VGDYIG I YAS I KTDS TFSG FLVYS DWHS S P VTA 

::::::;:::::::::::::::::::::::::: 
VGDYIG I YAS I KTOSTFSG FLVYS OWHS SPVFA 
220 230 240 



FIG. 5 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Mxller, CABIOS <1989) 

> hT2S3 a.a. aa vs . 

> SwissProt Q15848 - (untitled) , A 
scoring matrix: paml20.mat, gap penalties: -12/-4 
38.7% identity; Global alignment score: 262 

in 20 30 40 50 60 

inputs MRPL-LVLLLLGLAA- - -GSPPLDDNKIPSL- - - . C PG-HPGLPGTPGHHGSQGLPG^RIX3RMAPGA 

10 20 30 40 50 60 /u 

™ 60 90 100 110 120 130 

inputs pgekgec^rpglpgprgdpgprgeagpagp^ 
pgligpkgdiget^ 

80 90 100 110 120 13Q 

inputs EQGHYDAVTCKPTCQVPGVYYPAVHAT^ 

6^^STC^^ipG^^ 
iiS 150 160 170 180 190 200 

210 220 230 240 

inputs EDQVKVQV-GVGDYIGIYASIKTDSTFSGFLVYSDV7HSSPVFA 
: : . : . : : . .::::.:::.: : - - 

GDQVWLQVYGEGERNGLYADNDNDSTFTGFLLY HDT N 

210 220 230 240 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT2S3 a. a. 243 aa vs. 

> SwissProt Q15848 - (untitled) 244 aa 
scoring matrix: pami20.mat, gap penalties: -12/-4 

38.3V identity; Global alignment score: 264 

10 20 30 40 50 60 

inputs MR P LLALLLLG LVSGS P PLDDNK I P S L - -CPG-QPGLPGTPGHHGSQGLPGRDGRDGRDGAPGA 

• . :::: : 

I^LIX3AVXLL1JU^PGHDQETTTQ^ 

10 20 30 40 50 60 70 

70 80 90 100 110 120 130 

inputs PGEKGEGGRPGLPGPRGEPGPRGEAGPMGAIGPAGECSVPPRSAFSAKRSESRVPPPADTPLPFDRVLLN 
** * * : : :.::::.: : :»::• s • • j ••«••••• 

PGLIGPKGDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRSA^ 

80 90 100 110 120 130 

140 150 160 170 180 190 200 

inputs EQGHYDPTTGKFTCQVTGvYtfFAVHATVYW 

• " • * • • • • • • • • • « « . « « • . . . » : • • * • !••••••• 

^hnrrrcsTGKraoiipGL 

140 150 160 170 180 190 200 

210 220 230 240 

inputs EIX}VWVQV-GVGDYIGIYASIKTDSTFSGFLVYSDWKSSPVFA 
::::•:: : • « • • : : • .::::.*•:•: • • • • 

GDQVWLQVYGEGERNGLYADNDNDSTFTGFLLY HDT N 

210 220 230 240 



FIG. 6B 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT253 n.a. 1339 aa vs. 

> adipocyte n.a. (AI417S23) 653 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

29.1% identity; Global alignment score: -1168 

10 20 30 40 50 60 70 

inputs GTCGACCCACGCGTCCGGGACTGGGGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGCGCGGGGG 
: . : .::: :.. :. :::: :- :••: :: - 

TTTTTT GCAT — GTAACTTTTTTATTGA GGCA C AACAAGG C ATTGTAACTTG CCTGG A 

10 20 30 40 50 

80 90 100 110 120 130 140 

inputs CTGGAGCACCACCAACTGGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGG 
::::: . : . : ::: •: 

CTTGAG GCAGT CAGTTTAGTAAGCT GAA- CGTTAATACAGTTAA 

60 70 80 90 

150 160 . 170 180 190 200 210 

inputs GGGGACTGCGAGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCTATGAGGCCACTCCTCGTCCTGCT 

:::: « : - 

GGATTAAG TGCAAACAATATA CATTC ACA 

100 110 120 

220 230 240 250 260 270 280 

inputs GCTCCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGACGACAACAAGATCCCCAGCCTCTGCCCGGGGCAC 



GCX — TGA — CTAGCGA — GGCT ACATCA-CAATTTATAAAG TGCCAGA 

130 140 150 160 170 

290 300 310 320 330 340 350 

inputs CCCGGCCTTCCAGGCACGCCGGGCCACCATGGCAGCCAGGGCTTGCCGGGCCGCGATGGCCGCGACGGCC 



TT — AGT GCTAA TTGTCATTCA — GCTTG ATTTTTCAC 

180 190 200 

360 370 380 390 400 410 420 

inputs GCGACGGCGCGCCCGGGGCTCCGGGAGAGAAAGGOGAGGGCGGGAGGCCGGGACTGCCGGGACCTCGAGG 

::: : • s : . z : z z 

CTCAGGAAGGAAAA — CAAAAAAGTAAGG ACC TCCTC 

210 220 230 

430 440 450 460 470 480 490 

inputs GGACCCCGGGCCGOGAGGAGAGGCGGGACCCGCGGGGCCCACCGGGCCTGCCGGGGA 
n z : x : : . 

CCTCTAGGAA 

240 

500 510 520 530 540 55*6 560 

inputs CrcCGATCCX;CCTTCAGCGCK^GC^ 



FIG. 7 A 
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CAAAAAACATTTTCCT AAACCAA 

250 260 270 

570 580 590 600 610 620 630 

inpu t s TCG ACCGCGTGCTGGTGAACG AGCAGGGACATTACGACGCCGTCACCGGCAAGTTCACCTGCCAGGTGCC 
: ::: . :: ::: .:: ::. :...: 

TCAGTC ATGA — GGGCAAAGAC — TACTTTTCCTTCA ATC-CCA — CTAAT 

280 290 300 . 310 

640 650 660 670 680 690 700 

inputs TGGGGTCTACTACTTCGCCGTCCATGCCACCGTCTACCGGGCCAGCCTGCAGTTTGATCTGGTGAAGAAT 



TAGAA CACCATCC TTTTAT T 

320 330 

710 720 730 740 750 760 770 

inputs GGCGAATCCATTGCCTCTTTCTTCCAGTTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGG 



GTCAATACTGT ACTGACTTTCAAT CTTG 

340 350 360 

780 790 800 810 820 830 840 

inputs CCATGGTGAGGCTGGAGCCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTA 



— ATAAAGAAGAT-- AGCCTGAAAAC GTAGAATAT 

370 380 390 

850 860 870 880 890 900 910 

inputs TGCCAGCATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCCAGTC 
: 

TTCCAGCTACT TCCATAAAT TGCTCCCCTGT- 

400 410 420 

920 930 940 950 960 970 980 

inpu t s TTTGCTTAGTGCCCACTGCAAAGTG AGCTCATGCTCTCACTCCTAGAAGGAGGGTGTGAGGCTGACAACC 



GCAGACGT — 

430 

990 1000 1010 1020 1030 1040 1050 

inputs TGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACT 

:♦:::.. :::: : :::::::: :::;:..:.:•::: : 

-r AACCATAT CTGGTCTCCCTGGAA GAGCTGAAGAATTGCATGAT — 

440 450 460 470 

1060 1070 1080 1090 1100 1110 1120 

inputs GTCCTGCTGCTGGCAAGGAATGGGAAC^GTGGCTC 

si::*::: :••::::: ::::::: t : :::::::::•:::: 

TGCTAGCA GTTTCA-TGG TCTG-GAGCA C CATCATTGG-CATAGGCT 

480 490 500 510 520 

1130 1140 1150 1160 1170 1180 1190 
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input 8 GG ATTTCTGCCCAAG ACCAG AGG AGTGTGCTGTGCTGGCAAGTGTAAGTCCCCCAGTTGCTCTTC 

:: : „ «. :::: - : .: : : : 

GATA CCAAGACCT CTT — CATTCTTCANTGAG GTTG-AC — ATACAG 

530 540 550 560 

1200 1210 1220 1230 1240 1250 1260 

inputs GAGCCCACGGTGGGGTGCTCTCTTCCTGGTCCTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCT 
**••• •••*••••••••■ • 

TGGCACAT TCACTGCCAG — CTTTTACATGTGAAAAA TGAAAAACGT 

570 580 590 600 

1270 1280 1290 1300 1310 1320 1330 

inpu t S GGGGCCGGCCCTTTTCTCAG AG ATCACTCAATAAACCTAAG AACCCTCCAAAAAAAAAAAAAAAAAAAAG 



AGTGCCA TTCACTTGG — CA ATTAAATCTA CCAAAGCTGAGATCAAA- 

610 620 630 640 650 



inputs GGCGGCCGC 



FIG. 7C 
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ALIGN calculates a global alignment of two sequences 
version 2-OuPlease cite: Myers and Miller, CABIOS (1989) 

> mT253 n.a. 1263 aa vs. 

> adipocyte n.a. (AI417523) (653 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

30.4% identity; Global alignment score: -840 

10 20 30 40 50 . 60 70 

inputs GTCGACCCACGCGTCCGCGCTGTGAAGCCAGCAAGGAGCAACCAGAAGCTAGGAGTCAGTCAGCAAGGAC 

m • • ...... .«•••••••«».» 

* ........*•. .. ....... .......a* 

TT TTT TGCATGTAACTT TTTTATTGAGGCA — CAACAAGG-C 

10 20 30 

80 90 100 110 120 130 140 

inputs AGGGGCTGCCTGCCTACAGACTACAAGAGAGGTTCCTGGAGTCTGAGCCTCCGGGGTCACCACCATGAGG 



ATTG TAACT TGCCTGGA CTTGAGG 

40 50 60 

150 160 170 180 190 200 210 

inputs CCACTTCTTGCCCTTCTGCTTCTGGGTCTGGTGTCAGGCTCTCCTCCTCTGGACGACAACAAGATCCCCA 
: . ::.::: .::.:: 

CAG TCAGTTT AGTAAG CTGAACGTTAATA 

70 80 90 

220 230 240 250 260 270 280 

inputs GCCTGTGTCCCGGGCAGCCCGGCCTTCCAGGCACACCAGGTCACCATGGCAGCCAAGGCCTGCCTGGCCG 



— CAGTTA — AGGA TTAAGTGCAAACAATAT AC ATTCAC AG CTTG ACT AGC-G 

100 110 120 130 140 

290 300 310 320 330 340 350 

inputs TGACGGCCGTGATGGCCGCGACGGTGCACCCGGAGCTCCGGGAGAGAAAGGCGAGGGCGGGAGACCGGGA 



AGGCTAC ATCACAATTTATAAAGTGC CAG ATT A GTG 

.150 160 170 

360 370 380 390 400 410 420 

inputs CTACCTGGCCC^CGTGGGGAGCCaXXSCCGCXJTGGA 

• ••••• • • • • • • ■ . . * ; » . J . « 

... •«• ... ••••••• ... ..... ••••• 

CTAATTGTCATTCA GCTTGATTTTTCA CCTCAGGAA — GGAAAACAA 

180 190 200 210 220 

430 440 450 460 470 480 490 

inputs GGGAGTGCTCGGTACCCCCACGATGAGCCTTCAGTGCCAAGCGATCCGAGAGCCGGGTACCT^ 
...:::. .: ::: ::•: iti :.:: ::.: .:*:.: • : • : : : 

AAAAGTA AGGACCTCCTC CCT CT AG -G AACAAAAAAC- ATTTTCCTA 

230 240 250 260 

500 510 520 530 540 550 560 

inputs CGACAC^CCCCTACCrTTCGACCXSTGTGCrGCTAAATGAGCAGGGCCATTACGACCCCACT^ 

.:::..::.. :.::.: ::.::.. . . : . : • : ::::::: : •** 
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AACCAATCAGTCATGAG-GGCAAAGACTACTTTTCCTT — CAATCCCACTAAT TAG 

270 280 290 300 310 

570 580 590 600 6X0 620 630 

inputs TTCACCTGCCAAGTGCCTGGTCTCTACrACTTTC 

• •«• • • • • • • •«••• •••••• 

AACACCATCCTTTTA — TTG TCAATACTGT ACTGACTT 

320 .330 340 350 

640 650 660 670 680 690 700 

inputs TTGATCTTGTCAAAAACGGGCAGTCCATCGCCTCTTTCTTCCAGTATTTTGGGGGGTGGCCCAAGCCAGC 



-TCAATCTT GATAAAGAAGATAGCC- 

360 370 



710 720 730 740 750 760 770 

inpu t s CTCGCTCTCAGGGGGTGCGATGGTAAGGCTAGAACCTGAGGACCAGGTGTGGGTGCAGGTGGGCGTGGGT 



— TGAAAACGTAGAA TATTTCCAG C TAC 

380 390 400 



780 790 800 810 820 830 840 

inputs GATTACATTGGCATCTATGCCAGCATCAAGACAGACAGTACCTTCTCTGGATTTCTCGTCTATTCTGACT 

• • • ■ • • • • * • • •••••«••• « « • • • • • 

• • • ■•»•« • mm • * • • • • •«•■•■ • • ■ • a •••• • mm 

— TTCCATAAATTGCT CC — C — CTGTGCAGACGTAACCATATCTGG TCTC — C CT 

410 420 430 440 450 



850 860 870 880 890 900 910 

input s GGCACAGCTCCCCAGTCTTCGCTTAAAACACAGTGAACCCGGAGCTGGCACTTGCTCCTCAGTGGAGGGT 

■ m m •••• 

• «•••«• ••>•••*•••• 

GGAAGAGCTGA — AGAATT-GCATGATT GCTAGCAGTTTC : ATGGT 

460 470 480 490 

920 930. 940 950 960 970 980 

input s GTGACACTAACCCGCGCAGCGCATACCAGGAGGGCTGGCCCCCTGGAATATTGTGAATGACTTAGGAAGA 



■CTGGA GCACC ATCATTGGCATAGGCTGA 

500 510 520 



990 1000 1010 1020 1030 1040 1050 

inpu t s GAGGGAGCCACTTCC^GTCCCACTGCTGGCAATGAATGGAGACAGGCT 

: . : : . :::::: 

TACCAAGACCTCTTCATTCTT CAN TGAGGT — TGACA 

530 540 550 

1060 1070 1080 1090 1100 1110 1120 

Inputs AGCAGTGGCTGGGTTTCTGCCCAGGACrTTAGAA 

••::tts::. . : .:x:x: ••ssi: s«n st. t..t 

TACAGTGGCACATTCACTGCC AGCTTT TACA TGTGAAAAATGA AAAA 

560 570 580 590 600 

1130 1140 1150 1160 1170 1180 1190 
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inputs CTCCAAGGTGGGATGCTCCATTCCTAGTCCTGTGTCCCCTCTAGGTCCCTGACTCCATCTCTGCTGCTCC 
: :::::: : 55 

C GTAGTG CCATTC ACTTGG CAAT TAAATCTAC 

610 620 630 

1200 1210 1220 1230 1240 1250 1260 

inpu t S CAGGGCAGGCCTTTTTCTCAGAGGTCACTTAATAAACCTAAAATCCTCAAAAAAAAAAAAAAAGGGCGGC 



CAAAGCTG AG A TCAAA- 

640 650 



inputs CGC 
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GTCGACCCACXSCGTCCGCGGACGCGTGG^ 7 9 

MGPSTPLLILFLLSWSG 17 

AGGCTGCC ATG GGG CCC AGC ACC CCT CTC CTC ATC TTG TTC CTT TTG TCA TGG TCG GGA 138 

PLQGQQHHLVEYMERRLAAL 37 

CCC CTC CAA GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC CGA CTA GCT GCT TTA 198 

EERLAQCQDQSSRHAAELRD 57 

GAG GAA CGG CTG GCC CAG TGC CAG GAC CAG AGT AGT CGG CAT GCT GCT GAG CTG CGG GAC 258 

FKNKMLPLLEVAEKEREALR 77 

TTC AAG AAC AAG ATG CTG CCA CTG CTG GAG GTG GGA GAG AAG GAG CGG GAG GCA CTC AGA 318 

TEADTISGRVDRLEREVDYL 97 

ACT GAG GCC GAC ACC ATC TCC GGG AGA GTG GAT CGT CTG GAG CGG GAG GTA GAC TAT CTG 378 

ETQNPA L P C/VE FDEKVTG G P 117 

GAG ACC CAG AAC CCA GCT CTG CCC TGT J GTA GAG TTT GAT GAG AAG GTG ACT GGA GGC CCT 438 

GTKGKGR RN^KYDMVTDCGY 137 

GGG ACC AAA GGC AAG GGA AGA AGG AAT GAG AAG TAC GAT ATG GTG ACA GAC TGT GGC TAC 498 

TISQVR SMKILKRF GGPAGL 157 

ACA ATC TCT CAA GTG AGA TCA ATG AAG ATT CTG AAG CGA TTT GGT GGC CCA GCT GGT CTA 558 

WTKDPLGQTEKIYVLDGTQN 177 

TGG ACC AAG GAT CCA CTG GGG CAA ACA GAG AAG ATC TAC GTG TTA GAT GGG ACA CAG AAT 618 

DTAFVF P RLRDFTLAMAAR K 197 

GAC ACA GCC TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC ATG GCT GCC CGG AAA 678 

ASRVRVPFPWVGTGQLVYGG 217 

GCT TCC CGA GTC CGG GTG CCC TTC CCC TGG GTA GGC ACA GGG CAG CTG GTA TAT GGT GGC 738 

FI*Y FAR R P PGR P GGG G EM E K 237 

TTT CTT TAT TTT GCT CGG AGG CCT CCT GGA AGA CCT GGT GGA GGT GGT GAG ATG GAG AAC 798 

T I* Q L I K F HIjANRTVVD S S V F 257 

ACT TTG "CAG CTA ATC AAA TTC CAC CTG GCA AAC CGA ACA GTG GTG GAC AGC TCA GTA TTC 858 

PAEGLI PPYGLTADTYIDLA 277 

CCA GCA GAG GGG CTG ATC CCC CCC TAC GGC TTG ACA GCA GAC ACC TAC ATC GAC CTG GCA 918 

A D E E GL WAV YATREDDRH I* C 297 

GCT GAT GAG GAA GGT CTT TGG GCT GTC TAT GCC ACC CGG GAG GAT GAC AGG CAC TTG TGT 978 



DTEQQ WDTPCP 317 
* 
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CTG GCC AAG TTA GAT CCA CAG ACA CTG GAC ACA GAG CAG CAG TGG GAC ACA CCA TGT CCC 1036 

RENAEAAFVICGTLY VVYN_T 337 

AGA GAG AAT GCT GAG GCT GCC TTT GTC ATC TGT GGG ACC CTC TAT GTC GTC TAT AAC ACC 1098 

RPASRARIQCSFDASGTLTP 357 

CGT CCT GCC AGT CGG GCC CGC ATC CAG TGC TCC TTT GAT GCC AGC GGC ACC CTG ACC CCT 11S8 

ERAALPY FPRRYGAHASLRY 377 

GAA CGG GCA GCA CTC CCT TAT TTT CCC CGC AGA TAT GGT GCC CAT GCC AGC CTC CGC TAT 1218 

NPRERQL YAWDDGYQI VYKL 397 

AAC CCC CGA GAA CGC CAG CTC TAT GCC TGG GAT GAT GGC TAC CAG ATT GTC TAT AAG CTG 12 78 

EMRKKEE E V * 407 
GAG ATG AGG AAG AAA GAG GAG GAG GTT TGA 1308 

GGAGCTAGCCTTGTTTTTTGCATCTTTCT 1387 

CTTO^TGTGGGCCAGTTGTGGCTCAAATCCTCT AT ATTTTTAGCCAATGG 1466 

TCATACGGAACTCCAGATCCTG AGTAATCCTTTTAGAGCCCGAAGAGTCAAAACCCTCAATGTTCCCTCCrGCrCTC 1545 

GCCCCATGTCAAGAAATTTCAGGCTAATCATGCCCGAGACCC^^ 1624 

AGGCAGCAGTGTTCTTCCCCTCAGAGTG ACITGGGGAGGGAGAAATAGGAGGAGAaSTCCAGCTCTGTCCTCTCTTCCT 1703 

CACTCCTCCCTTCAGTGTCCTGAGGAACAGGACTTTCTCCACATTC 1782 
ATCCACTGCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 1831 
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MGPSTPLLILFLLSWSGPI^GQQHHLVEYMERRLAALEERIAQCQI>2SSRHAAELRDFKN 

KMLPLLEVAEKEREALRTEADTISG 

GKGRRNEKYTDMVTDCGYTISQVRSMKILKRFGGPAG 

FWPRLRDFTLAMAARKASRVRVPF 

LI KFHLANRT WDS S VFP AEGL I P P YG LTADTY I DLAADEEGLWAVYATREDDRHLC LAK 
LDPQTLDTEQQWDTPCPRENAEAAFVICGTLYWYNTRPASRARIQCSFDASGTLTPERA 
AL P YF PRRYG AHAS LRYN P RERQ L YAWDDG YQ I VYKLEMRKKE E EV 
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MGPSAPLLLLFF 12 

GTCGACCCACGCGTCCGACTTAAGGCTGCC ATG GGG CCC AGT GOT CCT CTG CTG CTC CTC TTC TTT 66 

LS WTGPLQGQQHHLVEYMER 32 

TTG TCA TGG ACG GGA CCC CTT CAG GGA CAG CAG CAC CAC CTT GTG GAG TAC ATG GAA CGC 126 

R LAA L E E R L A Q C Q D Q ♦ S SRHA 52 

CGA CTA GCT GCC TTA GAG GAA CGG CTG GCC CAA TGC CAG GAT CAG AGT AGT CGG CAT GCT 166 

AELRDFKNKMLPLLEVAEKE 72 

GCC GAG CTT CGG GAC TTC AAA AAC AAG ATG TTG CCT CTC CTG GAG GTG GCA GAG AAG GAG 246 

RETLRTEADSISGRVDRLER 92 

CGG GAG ACC CTC AG A ACT GAA GCA GAC TCC ATC TCA GGA AGA GTG GAC CGT CTT GAA AGG 306 

EVDYLETQNPALPCVElT^DEK 112 

GAG GTA GAC TAT CTG GAG ACA CAG AAC CCA GCT TTG CCC TGT GTA GAG CTGy GAT GAG AAG 366 

VTGGPGA^KGKGRRNEKYDMV 132 

GTG ACT GGA GGT CCT GGA GCC AAA GGC AAG GGC CGA AGA AAT GAG AAA TAC GAT ATG GTG 4 26 

T D C S Y T V A Q V R.S M K I L K R F G 152 

ACG GAC TGT AGC TAC ACA GTC GCT CAG GTG AGG TCA ATG AAG ATC CTG AAG CGG TTT GGT 486 

G S ') V> G L W T K D P L G P A ' E K I Y V L 172 

GGT TCAV'gW GGC CTA TGG ACC AAG GAT CCG CTG GGG ICCA GCA' GAG AAG ATC TAC GTG TTA 546 

DGTQNDTAFVFPRLRDFTLA 192 

GAC GGC ACC CAG AAC GAC ACG GCT TTT GTC TTC CCA AGG CTG CGT GAC TTC ACC CTT GCC 606 

MAARKAS R IRVPF PWVGTGQ 212 

ATG GCT GCC CGG AAA GCT TCC CGA ATT CGG GTG CCC TTC CCC TGG GTA GGC ACG GGG CAG 666 

LVYG 'G FLYYARR P PGG PGGG 232 

CTG GTG TAC GGT GGC TTC CTT TAT TAT GCT CGA AGG CCT CCT GGA GGA CCT GGA GGG GGT 726 

GELENTLQLI KFHL ANRTVV 252 

GGT GAA TTG GAG AAC ACT CTG CAG CTG ATC AAA TTT CAC TTG GCA AAC CGA ACA GTG GTG 786 

DSSVFPA ESLIPPYGLTADT 272 

GAT AGC TCA GTG TTC CCT GCA GAG AGC CTG ATA CCC CCC TAC GGC CTG ACA GCA GAT ACA 846 

YIDLAADEEGLWAVYATRDD 292 

TAT ATC GAC CTG GCA GCT GAT GAG GAG GGC CTG TGG GCT GTC TAT GCC ACT CGA GAT GAT 906 

DRHLCLAKLD PQTLDTEQQW 312 

GAC AGG CAT TTG TGT CTA GCC AAG TTA GAC CCA CAG ACA CTT GAC ACA GAG CAG CAG TGG 966 



N A E 



332 
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GAC ACA CCA TGT CCC AGA GAG AAC GCA GAG GCT GCG TTT GTC ATC TGT. GGG ACC CTG TAC 1026 

VVYNTRPASRARIQCSFDA^S 352 

CTT GTC TAT AAC ACC CGC CCT GCC AGT AGG GCT CGT ATT CAG TGT TCC TTC GAT GCC AGT 1086 

GTLAPERAALSYFPRRYG. AH 372 

GGT ACT CTC GCC CCT GAA AGG GCA GCA CTC TCC TAT TTT CCA CGC CGA TAT GGT GCC CAT 114 6 

ASLRYNPRERQLYAWDDGYQ 392 

GCC AGC CTT CGC TAT AAC CCC CGT GAG CGC CAG CTG TAT GCC TGG GAT GAT GGC TAC CAG 1206 

IVYKLEMKKKEEEV* 407 

ATT GTC TAC AAA TTG GAG ATG AAG AAG AAG GAG GAG GAA GTT TAA 1251 

GCAGCTAGCCTTGTGCTCTTGATTCTTATGCCCAGACATTTATATTC 1330 

CGAAGGCCAGTGGTGGTAGCTCATATACCCTAATTTCTAAAGGACAACCAAATT 1409 

ACTCCAGATCCTGGGTAGCATTTT AG AACTG AACAGCAAACAAACACCCTAAATCTTCACTCCTGCCTTATGTCC^ 1488 

AGTTTAGTTCCAAACTCAGAGCCCTGTCCTTTGG AGAGGGTCAACCCCAGACAGCAGGCGACAGCATTCTTGCCCTCAG 1567 

TATGACCGAAGGGAGAGAACTCAG AGACAAAGCTGCCCTCCCTCCCrTCCCCCTCCAGTGTAGGGGAGAATGGGGCTTT 1646 

CCCCACATC^CTTTGTATGGTAACAGTTTGCATTAAAAGGAAAACCCA 1721 
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>mT257 

MGPSAPLLLLFFLSVn<5PLQGQ0HHLVEYMERRX*AALEERLAQCQDQSSRHAAEt»RDFKN 
KMLPLLEVAEKERETLRTEADS I SGRVDRLEREVDYLETQNPALPC VELDEKVTGG PG AK 
GKGRRNEKYDWVTDCSYTVAQVRSKKILKRFGGSVGL^ 

FVFPRLRDFTLAMAARKASRIRVPFPVA^GTGQLVYGGFLYYARRPPGGPGGGGELENTLQ 
LI KFHLANRTVVDSSVFPAESLI PPYGLTAIxrYIDLAADEEGLWAVYATRDDDRHLCLAK 
LDPQTLDTEQQV/nTPCPRENAEAAFVICGTLYVVYOTRPASRARIQCSFDA^TLAPERA 
ALSYFPRRYGAHASLRYNPRERQLYAWDDGYQIVYKLEMKKKEEEV * ^ 
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ALIGN calculates a global alignment of tvo sequences 
version 2.0uPlease citei Myers and Killer, CABIOS (1969) 

> hT2S7 a. a. 406 aa vs. 

> oT2S7a.a. 406 aa 
flooring matrix: parol 20. mat, gap penalties: -12/-4 

94.1% identity; Global alignment score: 2097 

10 20 30 40 SO' 60 70 

inputs MGPSTPLLILFLLSWSGPLQGQQKKLVEYMEWUJU^ 

MGPSAPIXLLFFLSVOXJPLQGQQKHLVEYMERRIJUU-Ee^ 

10 20 30 40 SO 60 70 

80 90 100 110 120 130 140 

inputs KEREAIJlTIlMn'ISGRVDRX.EREVDYLCTQNPA^ 

KERETIJITEADSISGRVDRLERjEVDYXCTQNP 

80 90 100 110 120 130 140 

ISO 160 170 180 190 200 210 

inputs QVRSMKILKRFGGPAGLVTTKDPLGQTEICIYVLDG 

QVRSKXILKRFGGSVGLVTTKDPLGPA^ 

ISO 160 170 180 190 200 210 

220 230 240 2S0 260 270 280 

inputs GQLVYGGFLYFAR*PPGRPGGGGEMEHTLQLIKFHLANRTVV^^ 

GQLVYGG FL YY ARR P PGG PGGGG E L EKTLQL I KFKLANRTWDSS VF P AES LIP PYGLTADTY I DLAAD E 
220 230 240 2S0 260 270 280 

290 300 310 320 330 340 350 

i nputs EGLWAVYATttEODRKUXAKUJPOTtJDT^ 

********** •••••••••••♦•••••••••••••♦.♦••■•••■••••♦•■••^••■••JJJJJJ*JJ 

EGLWAVYATRDDDRHIXrLAXI^PQTI^EQQWirrPCPRI^ 

290 300 310 320 330 340 350 

360 370 380 390 400 

i npu t s ASGTLTPERAALP YFPRRYGAHASLRYNPRERQLYAWDDGYQ IVYKLEKRXKEEEV 

ASGTIJ^ERAALSYFPRRYGAKASLRYMPRERQLYAHDDGYQIVYTCLEM^ 
360 370 360 390 400 
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ALIGN calculates a global alignment of two sequences 
version a.OuPleaae cite: My era and Killer, CABIOS (1989) 

> hT2S7 a. a. 406 aa vs. 

> Patent Protein W7S120 - (untitled) 3SS aa 
scoring matrix: paral20 .mat, gap penalties: -12/-4 

86.9% identity; Global alignment score: 1681 

10 20 30 40 50 60 70 

inputs HGPSTPLLILFIXSWSGPLQGQQHHLVTYMER^ 

::::::::::::::::::::::::::::::::::::::::::::::::;::::::::::::::::::::: 
MG PSTPLLI LFLLSWSG PLQG<X}HKLVirfMERRLAALEERI^QCQDQS^^ 

10 20 30 40 SO 60 70 

80 90 100 110 120 130 140 

inputs ICEREALRTEADTISGRVDRLEREVTJYXCTQNPAL^ 

KJ£RLAIJttTAOTISGRVDRLER£VT)YLCTQ 

80 90 100 110 120 130 140 

ISO 160 1*70 180 190 200 210 

inputs QVRSMKILKRFCGPAGLWTKDPLCOTTKIYVIXGTQ^ 

CVRSMKI LKRFGG PAG LWT KD P LGQTE KI YVI*DGTQKDT AFVF PR1-RD FTLAMAARXAS R VRVP F PWVGT 
ISO 160 170 180 190 200 210 

220 230 240 250 260 270 280 

inputs GQLVYGGFLYFARRPPGRPGGGGEMENTLQLI KFKLAKRTWDSSVFPAEGLI PPYGLTADTYIDIAADE 

GQLVYGGFLYFARRPPGRPGGGGEMEHTLQLI KFKIJu^RTVVDSSVFPAEGLI PPYGLTADTYIDLAADE 
220 230 240 2S0 260 270 280 

290 300 310 320 330 340 3S0 

inputs EGLWAVYATREDDRKIXIJUaDPW 



EGLWAVYATR£DDRHlXri-AXIJDPQTLOT 

290 300 310 320 330 340 350 

360 370 380 390 400 

inputs AS GT LT P ERAAL P Y F PRRYGAKAS LRYNPRE RQL YAWDDGYQI VYKLE74RXKE E EV 

ASGPX 
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ALIGN calculates a global alignment of two sequences 

version 2.0uPlease cite: Myers and Miller. CABIOS (1989) 
> o a . l832 aa vs. 

^ a C 0214* 1925 aa 

scoring matrix: paml20.mat. gap penalties: -12/-4 
93. S\ identity; Global alignment score: 9158 

10 20 30 40 . 50 60 

i npu t s GTCC ACCC ACGCGTCC - - - GCGG ACGCGTGCC - - TG AGGGG AAG ACCCTC ACTCTACGTTCCTTCT ACTC 

::: ::: ::: :: .:: : ::: :::::::::::.::::::::::::::::::::::*: 

CCC -CGCCTCCAAAGCTAACCCTCGGGCTTG AGGGG AAG ANGCTG ACTCTACGTTCCTTCT ACTC 

10 20 30 40 50 60 



70 80 Q0 100 110 120 130 

inputs TGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCCTCTCCTCATCTTGTTCCTTTTGTCATGGTCG 

TGGCACCACTCTCCAGGCrcCCATCGCCCCCAGCACCCCT 

70 80 90 100 110 120 130 



140 150 160 170 180 190 200 

i npu t s GGACCCCTCCAAGGAC AGCAGCACCACCTTGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 

GGACCCCTCCAAGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAAC 
140 150 160 170 180 190 200 



210 220 230 240 250 260 270 

i npu t s GGCTGGCCC AGTGCC AGG ACC AGAGT AGTCGGC ATGCTGCTCAGCTGCGGGACTTC AAG AACAAG ATGCT 

GGCTGGCCCAGTGCCAGGACCAGAGTAGTCGGCATGCTGCTGAGCTGCGGGA^ 

210 220 230 240 2S0 260 270 



280 290 300 310 320 330 340 

inputs -GCCACTGCTTCAGGTCGCAGAGAAGCAGCGGGAGGCACrc^ 



NGCCACTCXrrGCAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAA 

280 290 300 310 320 330 340 



350 360 370 380 390 400 410 

inputs AGTGGATCGTCTCGAGCGGGAGGTAGACTATCTCGAGA 

AGTGGATCGTCTGGAGCGGGAGGTAGACTATC^ 

350 360 370 380 390 400 4X0 



420 430 440 450 460 470 480 

inputs GATGAGAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAG 

GATGAGAAGGTOACIXXSAGGCCCTGGGACCAAAGGCAAGGGAAGA^ 

420 430 440 450 460 470 480 

490 500 510 520 • 530 S40 550 



FIG. 15A 



24/61 



WO 00/78808 



PCT/US00/16883 



inputs CAGACIKTIXXXTTACACAATCTCTC^ 

CAGACTGTCCCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAG^ 

490 500 S10 520 530 540 550 

S60 570 580 590 600 610 620 

i npu t s TCTATCC ACCAAGCATCCACTGGCGCAAAC AGAGAAGATCTACCTGTTAG ATGGG AC ACAGAATG ACACA 

TCTATGGACCAAGCATCCACTGGGGCAAACAGAGAAGATCTACGTGTTACATGGGACACAGAATGACACA 
560 570 580 590 600 610 620 

630 640 650 660 670 680 690 

i npu t S GCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTTGCCATGGCTGCCCGGAAAGCTTCC 

GCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCC 

630 640 650 660 670 680 690 

700 710 720 730 740 750 760 

inputs TGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGGT 

TGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGGTGGCTTTCTTTATTTTGCTCGGAGGCCTCC 
700 710 720 730 740 750 760 

770 780 790 800 810 820 830 

inputs TG G AAG ACCTGGTG G AGGTGGTG AG ATGG AG AAC ACTTTGC AGCT AATC AAATTCC AC CTGGC AAACCG A 

TGGAAGACCTGGTGGAGGTGGTGAGATGGAGAACACTTTGCAGCTAAT^ 

770 780 790 800 810 820 830 

840 850 860 870 880 890 900 

inpu t s ACAGTGGTGGACAGCTCAGTATTC<XAGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAG 

i ; i J i J • i ■ 

ACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCT 

840 850 860 870 880 890 900- 

910 920 930 940 950 960 970 

inputs ACATCGACCnXXKTAGCTGATGAGGAAGGTCTTTGGGCTGTCT 

: : : : : : : : : : : : : : : : : : : : 1 1 : 3 ; t i i : : : : " • * • • • • • * " * * * * " * * " * * 

AC ATCGACCTGGCAGCTG ATG AGGAAGGTCTTTGGGCTGTCTATGCCACCCGGG AGG ATG ACAGGCACTT 
910 920 930 940 950 960 970 

980 990 1000 1010 1020 1030 1040 

inputs GTGTCTGG(X!AAGTT AG ATCCACAGACACTGGAC AC ATGTCCCAGAGAG 
........ ......... ::::::::::::::::::sis:::::i::::::i!!S5 !!::!:!::JS5::: 

GTCTCTGGC^GTTAGATCCAC^ 

980 990 1000 1010 1020 1030 1040 

1050 1060 1070 1080 1090 1100 1110 

inputs AATGCTGAGGCTG<XTTTGTCATCTGTGGGA( 
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AATGCTCACGCTCCCTTTNTCATCTGTGCCACCCTCTATGTCGTCT 

1050 1060 1070 1080 1090 1100 1110 - 

1120 1130 1140 1150 1160 1170 1180 

inputs CCCGC ATCC ACTGCTCCTTTG ATGCCAGCGGC ACCCTC ACCCCTG AACGGGCAGC ACTCCCTTATTTTCC 

CCCGCATCC ACTGCTCCTTTG ATGCCAGCGG - ACCCTC ACCCCTG AACGCGCAGC ACTCCCTTATTTTCC 
1120 1130 1140 115u 1160 1170 1180 

1190 1200 1210 1220 1230 1240 1250 

inputs CCGCAGATATGCTGCCCATGCCACCCTCCGCTATAACCCCCGAGAACGCCACCTCTATCCCTGGGATGAT 

CCGCAGATATGGTGCCCATGCCAGCCrcCGCTATAACCCCCGAGAACGCCAGCTCTATGCCTGGGATGAT 
1190 1200 1210 1220 1230 1240 1250 

1260 1270 1280 1290 1300 1310 1320 

inpu t s GCCTACCAGATTCTCTATAAGCTGCAGATGAGGAAGAAAGAGGAGGAGGTTTG 

CCCTACCAGATTGTC7ATAAGCTCGAGATGAGGAACAAAGAGGAGGAGGTITGAOT 

1260 1270 1280 1290 1300 1310 1320 

1330 1340 1350 1360 1370 1380 1390 

i npu C s TTTGCATCTTTCTCA.CTCCC AT AC ATTTATATTATATCCCCACTAAATTTCTTGTTC^ 

TTTGCATCTTTCTCACTCCCATACATTTATATTATATCCCCACTAAATTTCTTC 

1330 1340 1350 1360 1370 1380 1390 

1400 1410 1420 1430 1440 1450 1460 

inputs TGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTTAGCCAATGGCAATC 

TGTGGGCCAGTTGTGGCTCAAATCCTCTATATTTTTAG 

1400 1410 1420 1430 1440 1450 1460 

1470 1480 1490 1500 1510 1520 1530 

inputs TTTCATACCGAACTCCAGATCCTGAGTAATCCTTTTAGAGCCCGAAGA 

• ••••■••!!IIII!IJ!IIII 

TTTCATACGGAACTCCAGATCCTGAGTAATCCITTTAGAGCCCGAA 

1470 1480 1490 1500 1510 1520 1530 

1540 1550 1560 1570 1580 1590 1600 

inputs CCIK5CTCTCCTGCCCCATGTCAACAAATTTCAGGCTAAGGAT 

CXrTGCTCTCCTGCCCCATGTCAAC^ 

1S40 1550 1560 1570 1580 1590 1600 

1610 1620 1630 1640 16S0 1660 1670 

inputs ATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCXX 

••••••••••••••••••••••••••••••••••••••♦•••••••••••••••■••••••••"•"i2"2" 

ATGCGGGCAGGTCCAGGGAGCAGGCAGCAGTCTTCTTCC^^ 

1610 1620 1630 1640 1650 1660 1670 
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1680 1690 1700 1710 1720 1730 1 7 «> 

inpu t s AGGAGACCrrCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTC 

AGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCT^ 

1680 1690 1700 1710 1720 1730 1740 

1750 1760 1770 1780 1790 1800 1810 

inputs ACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAAAATCCA 

ACATTGTTTTGTATTGCAACATTTTGCATT 

1750 1760 1770 1780 1790 1800 1810 



1820 1630 
inputs AAAAAAAAGG GCGGCCGC- 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

1820 1830 1840 1850 1860 1870 1880 



inputs 



TC GTCTTCTCGC AGCCGT ACCCTTC TGTCGTCTTCTCGC AGCC 
1890 1900 1910 1920 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT2S7a.a. 406 aa vs. 

> Patent Protein W75120 - (untitled) 355 aa 
scoring matrix: paml20.mat, gap penalties: -12/-,4 

81.8% identity; Global alignment score: 1599 

10 20 30 40 50 60 70 

inputs MGPSAPLLLLFFLSWTGPLQGQQHHLVEYMERRIJ\ALEERI^ 

MGPSTPLLILFLLSWSGPJ^QQHHLTOYMERRLAALEERIAQCQDQSSRHAAELRDFKNKMLPLLEVAE 
10 20 30 40 50 60 70 

80 90 100 110 120 130 140 

inputs KERETLRTEADSISGRVTDRLEREVT3YLETQNPALPCVE 

KEREALRTEADTISGRVTJRLERETOYLETQNPALPCVXFDEKVTGGPGTKGKGRRNEKYDMVTDCGYTIS 
80 90 100 110 120 130 140 

150 160 170 180 190 200 210 

inputs QVRSMKILKRFGGSVGLWTKDPLG PAEKI YVLIX3TQNDTAFVFPRLRJ)FTLA>IAARKASRIRVPFPWVGT 

QVRSMKILKRFGG PAGLWTKDPLGQTEKIYVXDGTQtroTAFVFPRLRDFTLAMAARKASRvliVPFPWVGT 
150 160 170 180 190 200 210 

220 230 240 250 260 270 280 

inputs GQLVTGGFLYYARRPPGGPGGGGELEtrTLQLIKFHLANRTWDSSVFPAESLIPPYGLTADTYIDLAADE 

GQLV^GGFLYFARRPPGRPGGGGEMENTLQLIKFHLANRT^ 

220 230' 240 250 260 270 280 

290 300 310 320 330 340 350 

nputs EGLWAV^ATRDDDRHLCLAKLDPQTLDTEQQWDTPCPRENAEAAFVICGTLYVvyNTRPASRARIQCSFD 

EGLWAV^ATREDDRHUTLAKiDPQTLDTEQQWDTPCPRENAEAAFVICGTLYWYNTRPASRARIQCSFD 
290 300 310 320 330 340 350 

360 370 380 390 400 

nputs ASGTLAPERAALS YFPRR YGAHAS LRYNPRERQLYAWDDGYQIVYKLEMKKKEEEV 

ASGPX 
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ALTON calculates a global alignment of two sequences 
version 2.0uPleaee cite: Myers and Miller, CABIOS (1989) 

> WT257 n.a. 1721 aa vs. 

> Patent nucleotide V34217 - (untitled) 1890 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

76.2% identity; Global alignment score: 64 93 

10 20 
inputs GT CGACCCAC GCGTCC G ACTTAAGG 

GAGCAGGAGAGAAGGCACCGCCCCACCCCGCCTCO^AAGCTAACCCTCGGGCTTCAGGGGAAGAGGCT 

10 20 30 40 50 60 70 

30 40 SO 

i npu t s - CTGCCATGGGGCCCAGTGCTCCTCTGCTGCTCCT 

CTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCCTCTCCTCATCTT 
80 90 100 110 120 130 140 

60 70 80 90 100 110 120 

i npu t s CTTCTTTTTGTCATGGACGGGACCCCTTCAGGGACAGCAGCACCACCTTGTGGAGTACATGGAACGCCGA 

GTTCCTTTTG TC ATGGTCGGG ACCCCTCC AAGG ACAGC AGC ACCACCTTGTGG AGTACATGG AACG CCG A 
150 160 170 180 190 200 210 

130 140 150 160 170 180 190 

inputs CTAGCTGCCTTAG AGG AACGGCTGGCCCAATGCC AGG ATCAGAGTAGTCGGCATGCTGCCG AGCTTCGGG 

CTAGCTGCITTAGAGGAACGGCTGGCCCAGTGCCAGGACCAGAGTAGTCGGCATGCTGCTCAGCTGCGGG 
220 230 240 250 260 270 280 

200 210 220 230 240 250 260 

i npu C s ACTTC AAAAACAAG ATCTTGCCTCTCCTGGAGGTGGCAGAGAAGGAGCGGGAG ACCCTCAG AACTG AAGC 

ACTTCAAGAACAAGATGCTGCCACTGCTGGAGGTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTGAGGC 
290 300 310 320 330 340 3S0 

270 280 290 300 310 320 330 

inputs AGACTCCATCTCAGGAAGAGTGGACCGTCTTGAAAGGGAGGTAGACTATCTGGAGACACAGAACCCAGCT 

CGACACCATCTCCGGGAGAGTGGATCX3TCTGGAGCGGGAGGTA 

360 370 380 390 400 410 420 

340 350 360 370 380 390 400 

Inputs TTG CCCTGTGT AG AGCTGG ATG AG AAGGTG ACTGG AGGTCCTGG AGC CAAAGG CAAGGG CCG AAG AAATG 
.............. . ..... ;;::::::::::::: ::*•:••::•:•*::!:•*" •;::••::: 

CTCCCCTGTGTAGAGTTTGATC 

430 . 440 4S0 460 470 480 490 

410 420 430 440 450 460 470 

nput s AGAAATACGATATGGTGACGGACTGTAGCTACA<IAGTCGCTCA 

::: » . :i x t ::::::::::.::::::.: J ::::::.: : s s .:::::::: t s : i :::::: : 

AGAAGTACGATATGGTGACAGACTGTGGCTACACAATCTCTC^ 

S00 510 520 530 S40 S50 560 

480 490 500 510 520 S30 540 

nputs GTTTGGTGGTTCAG7^TGG<CTATGGACCAAG^ 

.:::::::: ::: ::: i ::::::::::::::::-::::: i : 
ATTTGGTGGCCCAGenxrrCTATG^ 

S70 580 590 600 610 620 630 

550 560 570 580 590 600 610 

nputs GGCACCatt3AACGACJu2GGCTTTTGTCTTC 

ti it i s s s t itni.ti :r:::sititmtii*ittt*is»ssttn::ss: S ^^^^^^ A 

640 6S0 ^ 660 670 4>80 690 700 

620 630 640 650 660 €70 680 • 

lputs AAGCTTC<XXJAATTCOaOTOC<Xr^^ 

:::::::::::.; : : : 1 1 1 : n n n : i m I ttt t titj • iiiiiiitin.ii 1 5 :::**: : : i : * : 
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AAOCTTCCCOAarCCCXWTGC^^ 

710 720 730 740 750 760 770 

690 700 710 720 730 740 750 

inputs TTATGCrOJAAGGCCTCCIXXJAGGA^ 

x : . ::::ii. ::::::::::::. ::::.:::.::.::ts:::r..:::::sst!S! iz:ii''"' ii:i ^ 
TTTTGCTCGGAGGCCTCCTGaAAGACCTGGTGGAGGTXXr^ 

780 790 800 810 820 830 840 

760 770 780 790 800 810 820 

i npu t s TTTCACTTGGCAAACCGAACAGTGCTGGATAGCTCAGTGTTC 

^CCACCTGGCAAACCGAACAGTGGTGGACAGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACG 
850 860 870 880 890 900 910 

830 840 850 860 870 880 890 

inputs GCCTG ACAGCACATACATATATCC ACCTGGCAGCTG ATGAGGAGGGCCTGTGGGCTGTCTATGCC ACTCG 

GCTTGACAGCAGACACCTACATCGACCTGGCAGCTGATGAGGAAGGTCTTTGGGCTGTCT 

920 930 940 950 960 970 980 

900 910 920 930 940 9S0 960 

inputs AGATGATGACAGGCATTTGTGTCTAGCCAAGTTAGACCCACAGACACTTCACAC^ 

.. ........... ::::::::::: ::::::::::::::::::=:: 

GG AGG ATG AC AGG CACTTGTGTCTGG CCAAGTTAG ATCC ACAGAC ACTGG AC AC AG AGC AG C AGTGGG AC 
990 1000 1010 1020 1030 1040 1050 

970 980 990 1000 1010 1020 1030 

i npu t s ACACCATGTCCCAGAGAGAACGCAGAGGCTGCGTTTGTCATCTGTGGGACCCTGTACGTTGTCTATAACA 

ACACCATGTCCCAGAGAGAATGCTGAGGC^ 

1060 1070 1080 1090 1100 1110 1120 

1040 10S0 1060 1070 1080 1090 1100 

i npu t s CCCGCCCTGCCAGTAGGGCTCGTATTCAGTGTTCCTTCGATGC 

:::: ::::::::: :::: :: :: ::::: ::::: :::::::: :::: 
CCCGTCCTGCCAGTCGGGCCCGCATCCAGTGCTCCTTTGATGCCAGCGG - ACCCTG ACCCCTG AACGGG C 
1130 1140 1150 1160 1170 1180 

1110 1120 1130 1140 1150 1160 1170 

i nput S AGCACTCTCCTATTTTCCACCCCGATATGGTGCCCATGCCA 

AGCACTCCCTT ATTTTCC^ C AG AT ATGGTG CCC ATGCC AGCCTCCG CTAT AACC CCCG AG AACG C CAG 
1190 1200 1210 1220 1230 1240 1250 

1180 1190 1200 1210 1220 1230 1240 

inputs CTGT ATG CCTGGG ATG ATGGCTACXAG ATTGTCTACAAATTGG AGATG AAG AAG AAGG AGG AGO AAGTTT 
• ; t : ! i i J i i i ••■•«■•■•' • • * • sssijjssi«x52S!S»»""-**»*** * * * 

CTCT ATG CCTGGG ATG ATGGCT ACX^G ATTGTCT AT AAGCT<K5 AGATGAGG AAG AAAGAGG 
1260 1270 1280 1290 1300 1310 1320 

1250 1260 1270 1280 1290 1300 1310 

inputs AAGCAG CT AGCCTTGTGCT CTTGATTCTTATGCCCAGACATTTATATT CCTGTGAGCTCTCC 

.xx :::::::::::: : : :::: :s:x::::::: a • * 55 

GAGGAGCTAJGCXrrT<nrrTrrTG<^ 
1330 1340 1350 1360 1370 1380 1390 

1320 1330 1340 1350 1360 1370 

inputs TGOuTTTCATCCrrCAAAACGA^ AGGACAA 

: x . ::::x:t. x: ?::: stts.:: : :t:.. :t : « : = 

TGTTC<rrCATTCTTCAAAT-GTGGGCCAGT^ CICAAATCCTCTATATITTTAGCCAATGGCAA 

1400 1410 * 1420 1430 14<t> 1450 1460 

1360 1390 1400 1410 1420 1430 1440 

inputs CX^IAATTCTCA> AQCXXX'ICIU 111 1 AT<K^U3AACT<X^^1CCTOQ<?TA0 -CATTITAflAACTQAACAG 
t suss si • tit tss iiui tt«t«iiuiuuiiiiii.itt* 1 ***** ** mt 1* ss 
TCAA A I l tlTT l X ^ lUX XTU ^ 
1470 1480 1490 . 1500 1510 1520 1530 



1450 1460 147^0 1480 1490 
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inputs CAAACAAACACCCTAAAT CTTCACTCCTGCCTTATGTCCACAAAGTT TAGTT CC 

.'. :::: : :: ::::: ::::: :: :: 

AGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCTGCCCC^ 

1540 1550 1560 1570 1580 1590 1600 

1500 1510 1520 .1530 1540 1550 1560 

inpu t S AAACTCAGAGCCCTGTCCTTTGGAGAGGGTCAACCCCAGACAGCAGGCX3ACAGCATTCTTGCCCTCAGTA 

AGACCCAGGGCTCTAACCTTGTATGCGGG - CAGGCCCAGGG AGCAGGCAGCAGTGTTCTTCCCCTCAGAG 
1610 1620 1630 1640 1650 1660 1670 

1570 1580 1590 1600 1610 1620 
inputs TGACC - GAAGGG AG AGAACTCAG AG A CAAAGCTGCCCTC CCTCCCTTCCCCCTCC AGTG 

TGACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTC 

1680 1690 1700 ' 1710 1720 1730 1740 

1630 1640 1650 1660 1670 1680 1690. 

i npu t S TAGGGGAGAATGGGGCTTTCCCCACATCACTTTGTATGGTAACA - " 

tcctgaggaacaggactttctccacattgttttgtattc^ 

1750 1760 1770 1780 1790 1800 1810 

1700 1710 1720 
inputs CAAAAAAAAAAAAAAAGGG CGGC CG 

CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACGGCACGAGGGGGGGTCCCGTACCCAATNGCCCTCA 
1820 1830 1840 1850 I860 1870 I860 

inputs C 



CATGCAT 
1890 
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GTCGACCCACX3CGTOCNTCCAGCX3TNCGGAG 79 

M 1 

CGC^GCCTCGGCACCTGCAGGTCCGTGCGTCCCGC^CTGGCGCCCCTC ATG 155 

IS LPGP L VTNLXRFLFLGLS 21 

ATT TCC CTC CCG GGG CCC CTG GTG ACC AAC TTG NTG CGG TTT TTG TTC CTG GGG CTG AGT 215 

ALAP PS R AQLQLHLPANRLQ 41 

GCC CTC GCG CCC CCC TCG CGG GCC CAG CTG CAA CTG CAC TTG CCC GCC AAC CGG TTG CAG 275 

AVEEGES GASAWYTLHREVS 61 

GCG GTG GAG GAG GGG GAA AGT GGT GCT TCA GCA TGG TAC ACC TTG CAC AGG GAG GTG TCT 335 

SSQPWEV PFVMWFFKQKEKE 81 

TCA TCC CAG CCA TGG GAG GTG CCC TTT GTG ATG TGG TTC TTC AAA CAG AAA GAA AAG GAG 3 95 

D.QVLSY I NGVTTS KPGVS LV 101 

GAT CAG GTG TTG TCC TAC ATC AAT GGG GTC ACA ACA AGC AAA CCT GGA GTA TCC TTG GTC 4 55 

YSMPSRN. LSLRVEGLQEKDS 121 

TAC TCC ATG CCC TCC CGG AAC CTG TCC CTG CGG GTG GAG GGT CTC CAG GAG AAA GAC TCT 515 

GPYSCS VNVQDKQGKSRGHS 141 

GGC CCC TAC AGC TGC TCC GTG AAT GTG CAA GAC AAA CAA GGC AAA TCT AGG GGC CAC AGC S75 

IKTLELN VLVPPAP PSCRLQ 161 

ATC AAA ACC TTA GAA CTC AAT GTA CTG GTT CCT CCA GCT CCT CCA TCC TGC CGT CTC CAG 635 

GVPHVGANVTLSCQSPRSKP 181 

GGT GTG CCC CAT GTG GGG GCA AAC GTG ACC CTG AGC TGC CAG TCT CCA AGG AGT AAG CCC 695 

A V Q YQ W D RQL PS FQTFFAP A 201 

GCT GTC CAA TAC CAG TGG GAT CGG CAG CTT CCA TCC TTC CAG ACT TTC TTT GCA CCA GCA 755 

LDVIRGS It S LTKL S SSMAG V 221 

TTA GAT GTC ATC CGT GGG TCT TTA AGC CTC ACC AAC CTT TCG TCT TCC ATG GCT GGA GTC 815 

YVCKAHN EVGTAQCNVTLEV 241 

TAT GTC TGC AAG GCC CAC AAT GAG GTG GGC ACT GCC CAA TGT AAT GTG ACQ CTG GAA GTG 875 

STGPGAAVVAEAVVGTIiVG L 261 

AGC ACA GGG CCT GGA GCT GCA GTG GTT GCT GAA GCT GTT GTG GGT ACC CTG GTT GGA CTG 935 

G L L A G LV XjLYHRRGKALE E P 281 

GGG TTG CTG GCT GGG CTG GTC CTC TTG TAC CAC CGC CGG GGC AAG GCC CTG GAG GAG CCA 995 

AND I K E D A IAPR? L PWPKS S : . 301 
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GCC AAT GAT ATC AAG GAG GAT GCC ATT GCT CCC CGG ACC CTG CCC TGG CCC AAG AGC TCA 1055 

DTISKNGT.LSSVTSARALR — P 321 

GAC ACA ATC TCC AAG AAT GGG ACC CTT TCC TCT GTC ACC TCC GCA CGA GCC CTC CGG CCA 1115 

PHGPPRPGALTPTPSLS S QA 341 

CCC CAT GGC CCT CCC AGG CCT GGT GCA TTG ACC CCC ACG CCC AGT CTA TCC AGC CAG GCC 117 5 

LPS PRHA HDRWGPPSTNI PH 361 

CTG CCC TCA CCA AG A CAT GCC CAC GAC AG A TGG GGC CCA CCC TCA ACC AAT ATC CCC CAT 123 S 

PWWGFFLWL* 371 
CCC TGG TGG GGT TTT TTC CTT TGG CTT TGA 1265 



GCCGCATGGGTGCTGNGCCTGTGATGGNGCCTGCCCAGAGTCAAGCTGGCTCTCTGGTATGATGACCCCACCACTCATT 134 4 

GGCTAAAGGATTTGGGGTCTCTCCTTCCrATAAGGGTCACCTCTAGCACAGAGGCCTGAGT 14 2 3 

TCCIG ACCCITAGTACTCIX3CCCC<^CCrCTCIT^ 1502 

AGAAGGAGAAGAGGAAGTGGATCTGGAATTGGGAGGAGCCTCCACCCACCCCTGACT 15 81 

jAAATTAGCTACTGACCAAGAGTGAGGGGCAGAGACTTCCAGTCA 166 0 

rCCCTATCTAACACCACCCTTGGCTCCCACTCCAGCTCCCTGTATTC 1739 

rACTGGGGCAGAGGATAGGGAATCTCTTATTAAAACTAACATGAAA 1818 
GATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGGCGGCCGC 1869 
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Kflv 




k ^ 



I I 



I ' 1 ' I < I < I ' I ' I ' I ' I ' I ' I ' I ' I 1 I 1 I ' I 1 I ' I ' I ' 
1 41 81 121 1G1 201 241 281 321 361 



MISLPGPLVTNLXRFLFLGLSALAPPSRAQI/JLHLPANRLQAVEEGESGASAWyTLHREV 
SSSQPWEWFVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNIiSLRVEGLQEKD 
SGPYSCSVNVQDKQGKSRGHSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSK 
PAVQYQV?DRQLPSFQTFFAPALDVIRGSLSLTNLSSSMAGVyVCKAHNEVGTAQCNVTLE 
VSTCPGAAWAEAWGTLVGLGLLAGLVLLYHRRGKALEEPANDIKEDAIAPRTLPWPKS 
SDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPRHAHDRWGPPSTNIP 

HPWWGFFLWL 
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(HKXIACCCACGCGTCCGGTGCA 7 9 

MILQAGTPETSLL 13 

CCTGAGTACTCCGGGCCAAGGAGGGCC ATG ATT CTT CAG GCT GGA ACC CCC GAG ACC AGC TTG CTG 14 5 

RVLFLGLST LAAFSRAQMEL 33 

CGG GTT TTG TTC CTG GGA CTG AGT ACC CTT GCT GCC TTC TCC CGA GCT CAG ATG GAG TTG 20 5 

HVPPGLNKLEAVEGEEVVLP S3 

CAC GTG CCC CCG GGC CTC AAC AAA TTG GAA GCG GTA GAG GGA GAA GAA GTG GTG CTC CCC 265 

AWYTMAREESWSHPREVPIL 73 

GCC TGG TAC ACG ATG GCA CGG GAG GAG TCG TGG TCC CAC CCC CGG GAG GTG CCC ATC CTG 325 

IWFLEQEGKEPNQVLSYING 93 

ATC TGG TTC TTG GAA CAA GAA GGG AAG GAA CCA AAC CAG GTG TTG TCT TAC ATT AAT GGA 385 

VMTNKPGTALVHSISSRNVS 113 

GTC ATG ACA AAT AAA CCT GGA ACA GCC CTG GTC CAC TCT ATC TCT TCA CGG AAT GTG TCC 44 5 

LRLGALQEGDSGTYRCSVNV 133 

CTG CGC CTG GGG GCA CTC CAG GAG GGA GAC TCT GGG ACT TAC CGC TGT TCT GTC AAT GTG SOS 

q'hDEGKSIGKSIKSIELKVL 153 

CAG AAT GAT GAA GGC AAA AGT ATA GGC CAC AGC ATC AAA AGC ATA GAG CTC AAA GTG CTG 565 

VPPAPPSCSLQGVPYVGTNV 173 

GTT CCT CCA GCT CCT CCA TCC TGT AGT TTA CAG GGT GTA CCC TAT GTC GGG ACC AAT GTG 625 

TLKCKSPRS KPTAQYQWE R I* 193 

ACC CTG AAC TGC AAG TCC CCA AGG AGT AAA CCT ACT GCT CAG TAC CAG TGG GAG AGG CTG 685 

APSSQVF FG PA LD AVRGS L K 213 

GCC CCA TCC TCC CAG GTC TTC TTT GGA CCA GCC TTA GAT GCT GTT CGT GGA TCT TTA AAG 745 

LTNLSIAMSGVYVCKAQNRV 233 
CTC ACT AAC CTT TCC ATT GCC ATG TCT GGA GTC TAT GTC TGC AAG GCT CAA AAC AGA GTG 



605 



GFAKCNVTLDVMTGSKAAVV 253 

3GC TTT GCC AAG TGC AAC GTG ACC TTG GAC GTG ATG ACA GGG TCC AAG GCT GCA GTG GTC 865 

AGAVVGTFVGLVLIAGliVfcL 273 

XTT GGA GCA GTT GTG GGC ACT TTT GTT GGG TTG GTG CTG ATA GCT GGG CTG GTC CTG TTG 925 

YQRRSKTLEELANDIKEDAI 293 

CAC CAG CGC CGG AGC AAG ACC TTG GAA GAG CTG GCC AAT GAT ATC AAG GAA GAT GCC ATT 985 

APRTIjPWTKGSDT I SKNGTL 313 

KTT CCC CGG ACC TTC CCT TGG ACC AAA GGC TCA GAC ACA ATC TCC AAG AAT GGG ACA CTT 1045 
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SSVTSARA LRppKAAP PRPG 333 

TCT TCG GTC ACC TCA GCA CGA GCT CTG CGG CCA CCC AAG GCT GCT CCT CCA AGA CCT QQC 1105 

TFTPTPS V SS QAL S S PRLPR 353 

ACA TTT ACT CCC ACA CCC AGT GTC TCT AGC CAG GCC CTG TCC TCA CCA AGA CTG CCC AGG 1165 

VDEPPP QAVSLTPGGVSSSA 373 

GTA GAT GAA CCC CCA CCT CAG GCA GTG TCC CTG ACC CCA GGT GGG GTT TCT TCT TCT GCT 1225 

LSRMGAV P VMVPA QSQAGSL 393 

CTG AGC CGC ATG GGT GCT GTG CCT GTG ATG GTG CCT GCA CAG AGT CAG GCT GGG TCT CTT 128 5 

V 395 

GTG TGA 1291 

TAGCCCA.GGCACTCATTAGCTACATCTGGTATCTGACCTTTCTGTAAAGGTCTCCTTGTG 1370 

3GGAGGATGCCCAGATTCTAGACCTCCAGTCCTTTGCTCCTACCT 144 9 

7TAAAATCTXXX3TCAAAGGACAAAAGGAGGAAATGGACCTGAGGTAGGGG 1528 

rTGCTTCTCCCTG AAGCCAGATG AATGCTGCGG AAG ATCGGCTACCCTCCAAGGGCTCTGGAGG AG ACTGCCAGTCAGT 1607 

^ATGCCCCTGGCTCrGTGATCTGTACAACACCCTTATCT^ 1686 

ATAACCTGTCCTGCTGGCITGGCTGGGTTTTGTTGTAGCAGGG 1765 

oATGTTTTTGTTTTTATTTTGCAAATTTCAATAAAGAT 1844 

1646 
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>mT258 

MILQAGTPCTSLLRVLFLX3LSTLAAFSRAQMEL 

EESWSHPRE^PILIWFLEQEGKEPNQVLSYIIKSVMTNKPGTALVHSISSRWSLRI^AW 
EGDSGTYRCSVNVQNDEGKSIGHSIKSIELKVLVPPAPPSCSLQGVPYVGTNVTLNCKSP 
RSKPTAQYQWERLAPS SQVFFG P ALDAVRGS LKLTNLS I AMSG VYVC KAQNRVG FAKCNV 
TLDVMTGS KAAVVAG AVVGTFVGLVLI AGLVLLYQRRS KTLE ELANDI KEDAI APRTLPW 
TKGSOTISKNGTLSSVTSARALRPPKAAPPRPGTFTPTPSVSSQALSSPRLPRVDEPPPQ 

AVSLTPGGVSS S ALS RMG AVPVMVPAQSQAGSLV 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPleaae cite: Kyere and Killer, CABIOS (1989) 

> hT2S8a.a. 370 aa vs. 

> BT25B a. a. 394 aa 

scoring natrix: paml20.cnat. gap penalties: -12/-4 

62.6% identity; Global alignment score: 108S 

10 20 30 40 50 60 

inputs MISLPCPLVTHIJCRFLFIGLSALAPPSRAQLOLHLPA- -imUJAVEEGESGASAWYTLHREVSSSOPWEV 

MI LQACTPErrSlXRVLFU3LSTLAAFSR^^ 

10 20 30 40 SO 60 70 

70 80 90 100 110 120 130 

inputs PFA/MWFFKQK£K£-DQVI>SYIN<7VTTSKP 

PILIWFLEQEGKEPHQVI^YIHGWrraKPGTA^ 

80 90 100 110 120 130 140 

140 150 160 170 180 190 200 

inputs RGHSI)OT.EUm-VPPAPPSCRl>QC^ 

XGKS I KS X ELJCVLVP PAP PSC5 I>GGVFYVGTNVTLNCICS PRS KPTA0YQWER1APSSQVFFGPALDAVRG 
ISO 160 170 180 190 200 210 

210 220 230 240 250 260 270 

inpu t s SLSLTNLS S SKAGVYVCKAKN EVGTAQCNVTLEVS TG PGAA WAEAWGT L VG LG LLAG L VLL YKRRG KA 

SLKLTHI-S I AMSGVYVCKAQNRVG F AXCNVTLD VMTCS KAAWAGAWGTFVGLVLIAGLVLLYQRRSKT 
220 230 240 250 260 270 280 

280 290 300 310 320 330 340 

inputs LE E P AND I KEDAI APRTL PWP KS SOT I S KHGTLS SVTS ARALRP PHG - P PRPG ALTPTP S LSSQ ALP S PR 

LEELANDIK£DAIAPRTLPVn*KGSOTISKNGTI^Sra 

29C 300 310 320 330 340 350 

350 360 370 

inputs HAH - DRKGPPSTNI PHPWWGFFLWL 

LPRVDEPPPOAVSLTPGGVSSSALSRMGAVPVMVPAQSQAGSL-V 
360 370 380 390 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT256a.a. 370 aa vs. 

> SwissProt Q99795 - (untitled) 319 aa 
scoring matrix: paml20 .mat, gap penalties: -12/-4 . . 
23.0% identity; Global alignment score: -102 

10 20 * 30 40 50 60 

inputs MISLPGPLVTNLXRFLFLGLSAJLAPPSRAQLQLHLPANRLQAVEEG-ESGASAWYTLHREVSSSQPWEVP 

WGKMWPV^WTLCA - VI* VTVT>^ 

10 20 30 40 50 60 

70 80 90 100 110 120 130 

inputs FVWWFFKQKEKEDQVLSYINGVTTSKPGVSLVT 

WIWPFSNKN YIHG-ELYKNRVSISNNAEQSDASITIDQLTMADNGTYECSVSLMSDLE G 

70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs HS IKTLELNVLVPPAPPSCRLQGVPKVGANVTiSCQSPRSKPAVQYQWDR- -QLPS FQTFFAPALfDVI RG 

NTKSRWLLVXVPPSKPECGIEGETIIGNNIQLTCQSKEGSPTPQYSWKRYNII^QEQPLAQPASGQ 

130 140 150 160 170 180 190 

210 220 230 240 250 260 270 

.nputs SLSLTNLSSSMAGVYVCKAHNEVGTAQChATTLEVSTC 



PVSLkNISTDTSGYYICTSSNEEGTQFCNITVAVRSPSMNVALYVGIAVGVVAALIIIGIIIYCCCCRGK 
200 210 220 230 240 250 260 

280 290 300 310 320 330 340 

nputs ALEEPANDIKEDAIAPRTLPWPKSSDTISKNGTLSSVTSARALRPPHGPPRPGALTPTPSLSSQALPSPR 

- -DDNTED-KEDA RPNREAYEEP- PEQLRELSREREEE -DDYR 

270 280 290 300 

350 360 370 

iputs HAHDRWGPPSTNI PHPWWGFFLWL 

QEEQR - - STGRES PDH LDQ 

310 
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ALI0N calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT258 n.a. 1869 aa vs. 

> GenBank U7972S - Human A3 3 antigen precursor mR 2793 aa 
scoring matrix: paml20.mat. gap penalties: -12/-4 

40.$% identity; Global alignment score: 1182 

10 20 

inputs GTCCACCC ACGCGTNCNT CCAG- C 

- - CTACCCCTTTGTG AG CAGTCTAGG ACTTTGTACACCTGTTAAGT AGGG AG AAGG CAGGGG AGGTGGCT 
10 20 30 40 50 60 

30 40 50 

i npu C s GTNC GGAG - CCGC CCT GGGTGTCA - GCG - GC 

GGTTTAftGGGGAACTTGAGGGAAGTAGGGAAGACTCCTCTTGGGACCTTTGGAGTAGGTGACACATGAGC 
70 80 90 100 110 120 130 

60 70 80 90 
inputs TCGGCTCCCGCCCAC- - GC TCCGGCCGT CGCGC-AGCCT CGGCA C C 

CCAGCCCCAGCTCACCTGCCAATCCAGCTGAGGAGCT 
140 ISO 160 170 1B0 190 200 

100 110 120 
inputs - TGC AGG TCC GTGC - - GTCCCG CGGCTGGCGCC CCTG 

GGGTGAGAAGAGGGA^AATTGCAGGGACCTCCAGTTGGGCCAGGCCAGAAGCTGCTGTAGCTTTAACCAG 
210 220 230 240 250 260 270 

130 140 150 
inputs AC TCCGTCC CGGCCAGGGA GGGC CATGA 

ACAGCrCAGACCrGTCTGGAGGCTGCCAGTGACAGGTTAGGTTTAGGGCAGAGAAGAAGCAAGACCATGG 
280 290 300 310 320 330 340 



160 170 180 
inputs TTT CC CTCCCGGGGCC - - CCTGGTG ACCAAC TTGN 

TGGGGAAGATGTGGCCTGTGTTGTGGACACTCTTGTC 
350 360 370 380 390 400 410 



190 200 210 220 
inputs TGC GGTTTTTGTTC CTGGGGCTGAGTG CCCT- C- - -GCGCC- -CC- - - 

AACTCCGCAGGACGTTCITCX3GGCTTCGCAGG 
420 430 440 450 460 470 480 

230 240 250 260 

inputs -CCTC GCGGGCC CA GCTGCAACT -G CACTTG C -CCGCC 

:::: ::::: :: ::::*s:.x.t.i.: : : 

ACCTC<!AGTCX3 AG AGGG ACTTATTCAATG^ 
490 500 510 S20 530 540 550 

270 280 290 300 310 320 

inputs AAC<X5GTTGCAGGCGGTGG AGGAGGG G<3AAAUl W lX3CiriCAGCATGGTACAarrTGC 

. . : : : :: : : . . : :::::::. it .::: 

GGCCGTTTTCAAAC^UU^AACrACATCCATGG^ 
S60 570 S80 590 600 610 620 

330 340 356 360 

inputs A CAGG<1AGGTC<TCTTCATC-CCA GCCATGGGAGG TGC-CCTT- -TGTGATGT 

i . : : :s tut tit .null • s: in. • : : t . * s 

TGAOCAGTCCGATG<XrrC<^TCACXAT^ 
630 640 650 660 670 660 690 



370 360 390 400 410 

inputs GGTTCT TCAAAC - - AGAAAQAAAAGGAGGATCAGGTGT TGTCCT 

i j: sss.is .::,.: tt t . .••sis ssss ::::: 
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<nXTTCCCTOATCrrCAOACCTX^AGGOCAACACC^ 
700 710 720 730 740 7S0 760 

420 430 440 

inputs --ACATCAA TGGGGTCA -CAACAAGCAAACCTG - - GAOTATC 

CCAAACCAGAATGCXSGeATCGAGGGAGAGACXrATAATTGGGAAC^ 
770 780 790 800 8X0 820 830 

450 460 470 480 

inputs CTTGGTCT ACTCCATG CCCTC CCGGAA CCTGTC 

GGAGGGCT - CACCAACCCCTCAGTACAGCTGGAAGAGGTACAACATCCTGAATCAGGAGCAGCCCCTGGC 
840 850 860 870 860 890 900 

490 500 S10 520 
i nputs CCTGC GGGT - GG AGGGTCTCC - AGG AG AAAGACTC TGGCCC CTACAGCTG - 

CCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATCTGT 
910 920 930 940 950 960 970 



S30 540 550 560 S70 
inputs - - CTCCG TG AATGTGC AAG ACAAACAAG - - -GCAA - ATCTAGG - GGCCA - CAG CAT CA 

ACCTCCAGCAATGAGGAGGGGACXSCAGTTCTGC^A 

980 990 1000 1010 1020 1030 1040 

580 S90 600 610 620 
inputs AAACC TTAG AACTC AATGTAC - TCGTT CCTC CA GCTCCTCCATC - - CTG 

TGGCCCTGTATGTGGGCATCGCGGTGGGCGTGGTTGCAGCCCTCATTATCATTGGCATCATCATCTACTG 
1050 1060 1070 1080 1090 1100 1110 

630 640 6S0 660 670 

inputs C- - CGTCT - CCAGGGTGTG - - C CCCATGTG GGGGCAAACGTGACC - - - CTG AGCTGCCAGT 

CTGCTGCTCCCGAGGGAAGGACCACAACACTGAAGA 
1120 1130 1140 1150 1160 1170 1180 

680 690 700 710 
■nputs - - CTCCAAGG - AG - TAAGCCCGCTGTCCAA TAC- - -CAGTGGG 

G AGG AG CC ACC AG AG C AGCTAAG AG AACTTTCCAG AG AG AGGG AGG AGG AGGATG ACTACAGG C AAG AAG 
1190 1200 1210 1220 1230 1240 1250 

720 730 740 750 760 

nputs ATCGG CA GCTTCCATCC- - -TTCCAG-ACTTTCTTTG-CA- - CCAG CATT AG ATGTCATCC 

AGCAGAGGAGCACTG^XXXTIXjAATCCCCXXJA 
1260 1270 1280 1290 1300 ■ 1310 1320 

770 760 790 600 810 620 

nputs GTGGG- TCTTTAAGCCTCA-CCAA CCTTTCOTCITCCATGGCTGG AGTCTA - TGT 

GAGG AAGGGTTAGG<3GTTCATTCTCCC<5C^ 
1330 1340 13S0 1360 1370 1380 1390 

830 840 650 860 670 
nputs CTGCAAGGCCCA - - CAATGAGGTGGGCA- - - CTGCCC - AATGTAA - -TGTG AC GCTGG 

J . • • • • a m m m m • • • . . • • • «tt ■ • • • • * ! . • J ' ! * 

* - • » • • • . • m # • « • • • • • • « * * « • • • • . • • • « • • 

CCCTCCATCXX^GACATTOATGXXjGACATT^ 
. 1400 . 1410 1420 1430 1440 * 1450 1460 

880 690 900 910 920 930 

apute *-AAOTX3AO-CACAQ05CCTOqAfl^ - -03TTQ-QACT 

-.lit t.s t s.t :t is., sjst.'t« s .it J. fist ..z . . tits ii ts sis. 
TAAGGGGGTCCCTGTGC -TGATCCTOCTQACCTCACTGTC - CTOTXIAAGTAACCCCTXXTXWCrGTGACA 
1470 1480 1490* 1500 1510 1520 1530 

940 9S0 960 970 '980 
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inputs OQQQTT QC T Q QCTCGGC TGOTCCT - -CTTGTACCACCOC COO OGCAAO-GC- 

: tti x:i is: ji ll 8 **' l! 

CCTOOTGCaQOCCTGGCCCTCAC7XAAGA<XAC^ 
1S40 1S50 1560 1570 1580 1590 1600 

990 1000 1010 1020 
inputs CCTGGAGG-- -AGC-CAG CCAAT- -GATATC AAGGAGG- -ATGCCAT 



CCTGG AAGCACAGCGCTGAGCATGGGGCGCTCCCACTCAG AACTCTCC AGGG AGGCG ATG CCAGCCTTGG 
1610 1620 1630 1640 1650 1660 1670 

1030 1040 1050 
inputs TGCTCCCCGG A - - - CCCTGC - CCTGG C - CCAAG AG CTCAG 

GGGGTGGGGGCTGTCCTGCTCACCTGTGTGCCCAGCACCTGGAGGGGCACCAGGTGGAGGGTTTGCACTC 
1680 1690 1700 1710 1720 1730 1740 

1060 1070 1080 1090 
i npu t s - ACAC AATCTCC AAGAATG - - - - GGACCCT TT CCTCTGT 



CACACATCITTCTTGAATGAATGAAAGAATAAGTGAG 
17S0 1760 1770 1780 1790 1800 1810 

1100 1110 1120 
i npu t s CACC TCCGCACG AGCC CT- CCGG CCA - - CCC - C ATGGCC - - C 



CTCCCACTCCCTTTCCAACCTCACTTCCCGTAGCTGCCAGTATGTT^ 
1820 1830 1840 1850 1860 1670 1880 

1130 1140 11S0 1160 1170 
inputs TCCCACGCCTGGTGCATTGACCC CCACGCCCAGTCTATC-CAGCCAGGC- 

TCCCACTCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCACACACTTTCCATCTCTGCCTGTCAATATCG 
1890 1900 1910 1920 1930 1940 1950 

1180 1190 1200 1210 1220 1230 

inputs - - CCTGCCCTCACCAAGACATGCCCACGACAG ATGGGG - -CCC- -ACCCTCAACCAATATCCCCCATCCC 

TACCTGTCC - CTCCAGGCCGATCTCAAATCACAAGGATTTCTC 
1960 1970 1980 1990 2000 2010 2020 

1240 1250 1260 1270 1280 
inputs TGGTGG GGTTT TTTCCTTTGGCTT TGAGCCGCATGG GT- -GCTGNGC 

TGG AAAC AATCCTGTTACTCTGTCCCACGTCCAATC CG AGTGC 

2030 2040 2050 2060 2070 2080 2090 

1290 1300 1310 1320 1330 

inputs CTGTG ATGGNGC - -CTGC-CCA-GAGTCAAG- -CTGGCTCTC-TGG- - TATGATG ACCC C 

:-: s s • I :::: ::: ::: s . : : : • • ' s : 
TCTCACTGTATTAGAGCGCCAGCTCCTTGGGGCAGGGCCT^ 

2100 2110 2120 2130 2140 2150 2160 

1340 1350 1360 1370 1380 

inputs AC CACTCAT TGG CT AAAG - - GATTTG<X3GTCTCTCCXTCCTATAAGGGT 

i : : : ::: t s : « ♦ - • - j • 

CCTAGTAGCTGGCGCCCATCXrrAGTGGGCACTTAAXXrrTA^ 

2170 2180 2190 2200 2210 2220 2230 

1390 1400 1410 1420 1430 

inputs --CAC--CTCTAG-CAC AGA-OOCCTGAGTCATGOaAAA<^OTCACACTCCTGACCC TTAO 

: : x:::.i ::i : . t tt.tt .ti.t.M st tit iS ' 

TT^XCTTCTCIGGTCTCCTTQAO^ -CCAAACCCACOTATTCA 
2240 2250 2260 2270 2260 2290 2300 

1440 14S0 1460 *470 1**0 1490 1500 

inputs TACTCTQCCXXXSlC L n xri^ 

t.s. ss t s»* .t:. i t- t is •x.it i » « < ' ln« « * • * • * 1 

TTCAOTOAJOTTAJUVCAOQAATTOATTTAJJIOTOAA^ - CAOATOOTCTOA 

2310 2320 2330 t 2340 2350 2360 '2370 
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1510 1520 1530 1540 1550 1560 

inputs AG G AG AAG AGG AAGT GG ATCTGG AATTGGG AGG AGCCTCCACCCACCCCTG ACTCCTC 

GTTCTTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATAATTGGCATGTCTCCTCAACCCACATGGG 

2380 2390 .2400 2410 2420 2430 2440 

1570 1580 1590 1600 1610 1620 
inputs CTTATG AAGCCAGCTGCTG AAATTAGCT ACTCA - - CCAAG AG TG AGGGGCA - G AGACTTC 



CTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCTGGT 
2450 2460 2470 2480 2490 2500 2510 

1630 1640 1650 1660 1670 

inputs CAGTCACTG AGTC - - TCCCA - GGCCCCCTT - - - GATCTGTACCCCACCCCTATCTAACAC 

GGGGCTGTC-CTGCTGCCCTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCTC 
2520 2530 2540 2550 2560 25*70 2580 

1680 1690 1700 1710 1720 1730 

inputs - - -CACCCTT- -GGCTCCCA CTCCAGCTCCCTGTATTGATATAACCTGTCAG - - GCTGGCTTGGTT 

AGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTC 

2590 2600 2610 2620 2630 2640 26S0 

1740 1750 1760 1770 1780 1790 

inputs AGGTTTTACTGGG -GCAGAGGATAGGGAATC -TCTTATTAAAACTAAC- ATGAAATATGTGTTGT 

TATCTTTTCTGGATGATCAGAAAAATAATTCCAT^^ 

2660 2670 2680 2690 2700 2710 2720 

1800 1810 1820 1830 1840 1850 1860 

.nputs TTTCATTTGC AAATTTAAATAAAG - - - ATACATAATGTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGG 

: * : . ■ 

ATATTTTTATATATATTGTTAAATCCTTTGCTTCAT - TCCAAATGCTTTC AGTAATAAT AAAATTGTGGG 
2730 2740 2750 2760 2770 2780 2790 

nputs CGGCCGC 
TGG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> mT258 a. a. 394 aa vs. 

> SwissProt Q99795 - (untitled) 319 aa 
scoring matrix: paml20.mat, gap penalties: -12/ -4 

23.0% identity; Global alignment score: -149 

10 20 30 40 50 60 70 

inputs MILQAGTPETSLLRVIjFLGLSTLAAFSRA^^ 

: . . :: : : : . : : • " - - - - - 

MV GKMWPVLW TLCAVRVTV^AISV^TPQDvXRASQGKSVTLPCTYHTSTSSREGLIQWD 

10 20 30 40 50 

80 90 100 110 120 130 140 

inputs PILIWFLE0EGKEPNQVX,SYINGVT4TNKPGTALVTiSISSR 



KLLLTKTERWIWPFSNKNYIHGELYKTJR-VSISNNAEQSDASITIDQLTMADNGTYECSVSLMSDLE- - 
60 70 80 90 100 110 120 

150 160 ;70 180 190 200 210 

inputs IGHSIKSIELKv1jVPPAPPSC$LG*JvpVvGTNVTL^ 

: . . . : : : . . . : . : . : . : . : . : . : : - : : : : - • : : • • * : 

-gntksrvrllvlvppskpecgiegetiignniqltcqskegsptpqyswkrynilnqe- -qplaqpasg 

130 140 150 160 170 180 190 

220 230 240 250 260 270 

inputs -SLKLTNLSIAMSGVYVCKAQNRVGFAKChTSnXDVTITGS -KAAW 

QPVS LKN I STDTS G Y Y I CTS S N EEGTQ FCN I TVA VRS P SMNVAL YVG I AVG WAAL 1 1 IG- - III YCCCC 
200 210 220 230 240 250 260 

280 290 300 310 320 330 340 

nputs KTLEELANDIKEDAIAPRTLPWTKGSDTISKNGTLSSVTSARALRPPKAAPPRPGTFTPTPSVSSQALSS 



RGKDDNTED- KEDA RPNREAYEEPPEQ LRELSR 

270 280 290 

350 360 370 380 390 

nputs PRLPRVDEPPPQAVSLTPGGVSSSALSRMGAVPVMVPAQSQAGSLV 



EREE - - EDD YRQEEQRSTGRES PDHLDQ - 
300 310 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> OT2S8 n.a. 1646 aa vs. 

> GenBank U79725 - Human A3 3 antigen precursor mR 2793 aa 
scoring matrix: pami20.mat, gap penalties: -12/-4 

40.0% identity; Global alignment score: 908 

10 20 30 
inputs GTCGACCC ACGC-GTC CG - -GTGCAC - - ATT - C - - GGGTTGCCGCC 

- - CTACCCCTTTGTG AGC AGTCTAGG ACTTTGTACACCTGTTAAGTAGGG AG AAGG C AGGGG AGGTGGCT 
10 20 ' 30 40 50 60 

40 SO 60 70 

inputs G CT CACC - CAC AACACCTGTAG AC AC - CGTGTGT 

GGTTT AAG GGG AACTTG AG GG AAG T AGGG AAG ACTCCTCTTGGG ACCTTTGG AG T AGG TG AC AC ATG AG C 
70 80 90 100 110 120 130 

80 90 100 110 

inputs CCAAC TCTCC - CTGAGTA - CTC CGGGCCA AGG - AGGG CC ATG AT 

CX^AGCCCCAGCTCACCTGCCAATCCAGCTGAGGAGCTCACCTGCCA^ 
140 ISO 160 170 180 190 200 

120 130 140 150 160 

inputs TCTTCAG GCTCG AACCCCCG A - - -GACCAG-C- - - TTGCTGCGGGTT - TTGTTCCTG 

GGGTG AG AAG AGGG AAAATTG C AGGG ACCTCCAGTTGGGCC AGGCC AG AAG C TG CTGT AG CTTT AACC AG 
210 220 230 240 250 260 270 

170 180 190 200 210 
inputs G - G ACTG AGTACCCTTGCTGCCTTCTCCCG AGC7CAG ATGGAGTT GCA CGTGCCC- - 

AC AGCTCAG A - - CCTGTCTGG AGGCTGCCAGTGACAGGTTAGGTTTAGGGC AGAG AAG AAGCAAG ACCAT 
280 290 300 310 320 330 340 

220 230 240 250 

inputs CC GGG C - CTC AA- -CAAATTGGAAG - CGGTAGAGGGAGAAGAAGTG 



C^TGGGGAAGATGTGGCCTGTGTTGTGGA(^CTCTGTCCAGTCAGG 
3S0 360 370 380 390 400 410 

260 270 280 290 300 
inputs GTGCTCCCCGCCTG - -GTACA-CGA TGGCACGGG AGG AGT CGTGGTCC 



G AA - ACTCCGCAGGACX3TTCTTCGGGCTCCGCAGGG AAAGAGTGTCACCCTC^ 

420 430 440 450 460 470 480 

310 320 330 340 3S0 

inputs - -CACC-CC- - -OK^AGGTGCCCATCCT GATCTGGTTCT TGOAACAAGAAGGGAAGGAA 



TCCACCTCX^GTCGAGAGGGACTTATTCAATX^ATAAGCTCC^ 

490 500 510 S20 S30 540 S50 

360 370 380 390 400 

inputs CCAAACCAGGTGrilTl'CriA CATTAATGGAGTCATGACAAATAAACCTG 

:...i:. : j :.:::...::.::. x . r : . ..:-s:-« 

TCTCGCCGTITTCAAACAAAAACTACATCCATGG^ 

S60 570 580 590 600 610 620 

410 . 420 430* 440 450 

inputs GAACAGCCCTGGTCCAC- -TCT ATCT CTTCA0Q0AATGTX3TC-CCTG0G 

ii - ii- in. i.i .i ii i s » »{. » 

TGCTGAOOIGTCXX^TGCCTCCATCACC^ 

630 640 . 650 660 670 680 . 690 

460 " 470 480 490 500 f 

inputs -C CTGGGGGCACTCCAGGAGGGAGACTCTSGGAC TTACOGCTGTTCTXnCAATGTGC - - - 

: :s:. i tt .::.:::::: . - : . : : : t:: . t::: 
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TCTGTCTCGCTG ATGTC AG A.CCTGG AGGGCAACR-CCAACTrCACGTCTCCGCCTC TTGGTCCTCGTGCCAC 
700 710 720 730 740 750 760 

S10 520 530 540 5S0 

inputs AG AATG ATG AAGGCAA - - AAGT AT AGG CCACA GCATCAAAAGCATA - -GAGCT- -CAA- - 

: . . : : . . : : : : . : . : : : : : . : : : : : . . : : : : ' • = 

CCTCCAAACCAG AATG CGG CAT CGAGGG AG AG ACCAT AATTGGG AACAACATC CAG CTG ACCTG CCAATC 
770 780 790 800 810 820 830 

560 570 S80 590 600 610 620 
inputs AGTGCTXjGTTCCTCCAGCTCCTCCATCCTGTAGTTTACAGGGTCTAC - - CCTATGTCGGG A CCAAT 

AAAGG AGGGCTC ACCAACCCCTCAGTACAGCTGGAAG - AGGTACAACATCCTG AATCAGG AGC AGCCCCT 
840 950 860 870 880 890 900 

630 640 6S0 660 
inputs GT GACC CTGAACTGCAAGTCCCCAAGGAGTAAA - - -CC TACTGC-TC 



GGCCCAGCCAGCCTCAGGTCAGCCTGTCTCCCTGAAGAATATCTCCACAGACACATCGGGTTACTACATC 
910 920 930 940 950 960 970 

670 680 690 700 710 
inputs AGTACCA GTGGGAGAG- -GCTG GCCCCATC-CT CC- -CAGGTCT TCTTTGG 

TGTACCTCC AGC AATG AGG AGGGG ACGC AGTTCTGC AAC ATCACGGTGGCCGTC AG ATCTCCCTCC ATG A 
980 990 1000 1010 1020 1030 1040 

720 730 740 750 760 
inputs AC - CAG CCTT AG ATG CTGTTCGTGGATCTTTAAAGC TCACTAACCTT TC- -CAT 

ACGTGGCCCTGTATGTGGGCATCXjCGGTGGGCGTGGTTGCAGCCCTCATTATCATTGGCATCATCATCTA 
1050 1060 1070 1080 1090 1100 1110 

770 780 790 800 

inputs TGCCATG TCTGGAGTCTATGT - - CTGCAAGGCTCAAAACAG AGTGG 

ctgctgctgctcccgagggaaggacgacaacactc 

1120 1130 1140 1150 1160 1170 .1180 

810 620 830 840 
inputs GCTTTG CCA- -AGTGCAAC GTGACCTT-- - - GG ACGTG ATG - -AC AGG- - 



cctatcaggagccaccagagcagctaagagaactttccagagagagggaggaggaggatcactacaggca 

1190 1200 1210 1220 1230 1240 1250 

850 860 870 880 

inputs GTCCAAGGCTGCAGTGGTCG - CTGG- -AGCAGTTGTGGG 

: . ' s : : 

AG AAG AGC AG AGG AGCACTGGG CX3TG AATCCCCGG ACCACCTCX3 ACCAGTG ACAGGCCAG C AGCAG AGGG 
1260 1270 1280 1290 1300 1310 1320 

890 900 910 920 

Inputs CA-CTTTTGTTGGGTTGGTG CTGATAGCTGGGCT GGTCCTGTT-- 



CGG CGG AGG AAGGGTT AGGGGTTCATTCTCCCGCTT<XrTGG C<nCCCTTCTCCTTTCTAAGCXX!riXriTCT 
1330 1340 13S0 1360 1370 1380 1390 

930 940 

nputs GTACCAO OGCC GG AOCAAGAC 

: : : 

CCTGTCCCTCCATCCCAGACATTOATGGGGACATTTCTT^ 

1400 1410 1420 1430 1440 1450 1460 

9S0 960 970 980 

nputs CTTGGAA QAGCTGG - CCAA-TGA TATCAAO -GAAGATGCC ATT 

: i.::n. it., sis • ' x*xt...tt . t 

CCTX3GTAAGGGGGTCCCIGTGCTGATCCI^ 

1470 1480 14*90 1500 1510 1520 . . 1530 

990 1000 1010 1020 
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inputs GCTCCC OGGACCTTGCCTT GGACCAA AGCCTC AGACACAA 

■ • * • • ::: : ti : : : . x . » i . : 

ACACCTGGTGCGGG CCTGGCCCTCACTCAAG ACCAGGCTXK^^CCTCCACTTCCCTCGTAGTTGGCAGG A 
1540 1550 1560 1570 1S80 1590 1600 

1030 1040 1050 1060 1070 10B0 
inputs TCTCCAAG AATGG - G ACACTTT - CTTCGGTCAC CTCAGCAC - G AGCTCT GCG - -GCCACCCA 

GCTCCTGGAAGCACAGCGCTGAGCATGGGGCGCTCCCACTCAGAACTCTCCAGGGAGGCGATGCC^ 

1610 1620 1630 1640 1650 1660 1670 

1090 1100 1110 
inputs AGG --- -CTGCTC- -CT CCAAGACCTGG CAC - -ATTTACT 



TGGGGGGTGGGGGCTGTCCTGCTCACCTGTGTGCCCAGCACCTGGAGGGGCACCAGGTGGAGGGTTTGCA 
1680 1690 1700 1710 1720 1730 1740 

1120 1130 1140 1150 
i npu t s C - CCACAC - -C---C AGTGT CTCTAGCCAGGCCCTGTCCT CAC 

CTCCACACATCTTTCTTGAATGAATGAAAGAATAAGTGAGTATGCTTGGGCCCTGCATTGGCCTTGGCCTC 
1750 1760 1770 1780 1790 1800 1810 

1160 1170 1180 1190 1200 1210 

inputs CAAGACT- - - GCCCAGGGTAG ATGAACC - CCCACCTCAGGCAGT - -GTCCCTGACCC- -CAGGTGGGGTT 



CAGCH'CCCACTCCCITTCCAACCTCACTTCCCGTAGCrGCCAGTATGTTCCAAACCCTCCTGGGAAGGCC 
1820 1830 1840 18S0 1860 1870 1880 

1220 1230 1240 1250 
. npu t s TCTTC TTCTG CTCTG AG CC GCATGGG TGCTGTGCCTGT-GATG 

ACCrCCCACrCCTGCTGCACAGGCCCTGGGGAGCTTTTGCCCACACACrrTCCATCTCTGC 

1890 1900 1910 1920 1930 1940 1950 

1260 1270 1280 1290 1300 1310 

nputs - -GTGCCTG - - - CAC AG AGTCAGGCT - GGGTCTCTTGTGTG A - - - TAGCCCAGGC ACTCATTAGCTACAT 



TCGTACCTGTCCCTCCAGGCCCATCTC AAATCACAAGGATTTCTCTAACCCTATC - CTAATTGTCCACAT 
1960 1970 1980 1990 2000 2010 2020 

1320 1330 1340 13S0 1360 1370 

nput S -C-TGGTATCTGACCT- - TTCTGT AAAGGTC - TCCTT - - GTGGCACAGAGG ACTCAATCTT - -GGGAGGA 
: :.. . : .t:: :.: 

ACGTGGAAACAATCCItrrTACTCrGTCCCACGTCaVATC^ 

2030 2040 2050 2060 2070 2080 2090 

1380 1390 1400 1410 1420 
nputs TGCCCACA TTCTAG ACCTCCAG -TCCTTTG - -CT CCTA- -CCTC CTT CTAT- r -TGT 

TGCTCTCACTGTATTAGAGCGCCAGCTX^rTGGGGCAG 

2100 2110 2120 2130 2140 2150 2160 

1430 1440 1450 

nputs TG G AATACTGG - GCC - - TC- -AGTAAG -ACTAAA ATCTG 

-* : : : it tn..i : . : : : 

AGCCCTAQTAGCItSGCGC<?CATCCTAGTGGGCACTTAAGCTTiU^ 

2170 2180 2190 2200 2210 2220 2230 

1460 1470 1480 

nputs GGTCA AAGOACAAAAGGAGOAAAT GGACC 

...tit: s.s:: ..si: 

GCCTTCCCTTCTCTGGTCTXXrrTGAGATGAT 

2240 2250 2260 2270 2280 2290 2300 

1490 1500 _ X510 1520 1530 

aputs - -TGAGGTAGG - - -GGGTTGGGAGTGAGGAGGCT-TCACTT CCTCCCTOCT; TCT- 

n :: s:.. . ..is ...t. t i t ::*.: ss: 

CATTCAGTGAGTTAAACACGAATTQATTTAAAGTOAACACACACAAGGQAGCTT^ 

2310 2320 2330 * 2340 23S0 2360 2370 
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1540 1550 1560 1570 1*580 
i npu t S - - - CCCTG AAGCCAG ATG AATG CT GC - -GGAAGATCGGCT ACCCTCCAAGGGCT 

■ • ■ ........ mm mm . • . » « • • • • 

• .... .......... . . 2 ; 2 . ...... ..... 

AGTTCnTGTGTCCTGGTAATTCCTCTCCAGGCCAGAATA 

2380 2390 2400 2410 2420 2430 2440 

1590 1600 1610 1620 1630 1640 

inputs C -TGGAGGAGACTGCCAGTCAGTGATGC- -CCCTGGCTCTG TGATCTGTAC AACACCC - TTATCTAA 

CCTGGTTGTTCCTGCATCCCGATACCTCAGCCCTGGCCCTGCCCAGCCCATTTGGGCTCTGGTTTTCT 
2450 2460 2470 2480 2490 2500 2510 

1650 1660 1670 1680 1690 
inputs TG-- - CTGTCCTT - TGCCGTTCGCTCCATCTCC - -CTGT ATTAATATAAC- - 

.. ....... •■•« .... •••• ... • .. 

. . ....... .... .. ... . .... .... .... ■ • ... 

TGGGGCTGTCCTGCTGCC-CTCCCACAGCCTCCTTCTGTTTGTCGAGCATTTCTTCTACTCTTGAGAGCT 
2S20 2530 2540 2550 2S60 2570 2580 

1700 1710 1720 
i np u t s CTGTC CTGCT GGCT TGGCTGG GTTT - - TGTTG 

CAGGCAGCGTTAGGGCTGCTTAGGTCTCATGGACCAGTGGCTGGTCTCACCCAACTGCAGTTTACTATTG 
2590 2600 2610 2620 2630 2640 2650 

1730 1740 1750 1760 1770 

i npu t s TAGCAGGGGG ATAGG AAAG ACATTT - - TAAAATCTG ACTTG AAATTG ATGTTTTTGTT 

CTATCTTTTCTGGATGATCAGAAAAATAATTCCATAAAT 

2660 2670 2680 2690 2700 2710 2720 

1780 1790 1800 1810 1820 1830 1840 

inputs TTTATTTTGCAAATTTCAATAAAGA TACATCG - - -CATTTGCATGGAAAAAAAAAAAAAAAGGGCG 

..i. :i. ".*«*!• ;..2I»22»2222««J * z 

TATATTTTTATATATATTGTTAAATCCTTC^ 

2730- 2740 2750 2760 2770 2780 2790 



inputs GCCGC 
T--GG 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPleaee cite: Myers and Miller , CABIOS (1989) 

> hT258 n.a. 1869 aa vs. 

> pec am n.a. 2557 aa 
scoring matrix: paml20.mat, gap penalties: -12/-4 

40.5% identity; Global alignment score: 1546 

10 20 . 

inputs G TC GACC CAC GCGTNCNTC-CAGCGTN- 



GAATTCCGGGAGAAGTGACCAGAGCAATTTCTGCTTTTCACAGGGCGGGTTTCTCAACGGTGACTTGTGG 
10 20 30 40 50 60 . 70 

30 40 50 60 70 80 
inputs -CGGAGCCGCC — CTGGG — TGTCAGCGGCTCGGCTCCCGCGCACGCT CCGGC CGTCGC 

..... . • . • . • . « • . • ■ • a * • • • • * ? * • 

• •■.•a* • ■.*•• • » • • • a • • ... . . • • . .. .» • 

GCAGTGCCTTCTGCTGAGCGAGTCAT-GGCCCGAAGGCAGAACTAACTGTGCCTGCAGTCTTCACTCTCA 
80 90 100 110 120 130 

90 100 110 120 130 

inputs GCAGCCTCGGCA — CCTGCAGGTCCG TGCGTCCCG CGGCTGGCGCCCCTGACTCCGTC 

GGATGCAGCCGAGGTGGGCCCAAGGGGCCACGATGTGGCTTGGAGTCCTGCTGACCCTTCTG-CTCTGTT 
140 150 160 170 180 190 200 

140 150 160 170 180 
inputs CCGGCCAGGGAGGGCCATGATTTCCCT — CCCGG — GGCC CCTGGTGA— CCAAC T 



CAAGCCTTG— AGGGTCAAGAAAACTCTTTCACAATCAACAGTGTTGACATGAAGAGCCTGCCGGACTGGA 
210 220 230 240 250 260 270 

190 200 210 220 230 
inputs TGNTGCGGTTT TTGTTCCTGGGGCTG-AGTGC — C-C TC-GCGCCCCC-CTCGCG GGCC 



CGGTGCAAAATGGGAAGAACCTGACCCTGCAGTGCTTCGCGGATGTCAGCACCACCTCTCACGTCAAGCC 
280 290 300 310 320 330 340 

240 250 260 270 280 

inputs -CAGCTGCAACTGC ACTTGC C CGCCAACCGGTTGCAGGCGGTG 

•••• . . . a • m a • . . . • . «Im2«: 

<..... ... ... ••••• . • ... . .... • ...... 

TCAGCACCAGATGCTGTTCTATAAGGATGACGTGCTGTTTTACAACATCTCCTCCAT 

350 360 370 380 390 400 410 

290 300 310 320 330 340 

inputs GAGGAGGGGGA AAGT — GGTGCTTCAGCA-TGGTACACCT TGCACAGGGAGGTGTCTTCATC 

: :::: : : . ::. .:: ::: ; xx n 

AGTTATTTTATTCCTGAAGTCCGGATCTATGACTCAGGGAC^TATAAATC 
420 430 440 450 460 470 480 

350 360 370 380 390 

inputs CCAG CCA-TGGGAGGTGCC — CTTT — GTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATC 

: : : :: :: : :. : :..:..: • 
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AAAGACAAAACCACTGCAGAGTACCAGCTGTTGGTGGAAGGAGTGCCC^^ 
<90 500 510 520 530 540 550 

400 410 420 430 440 450 

inputs AGGTGTTGTCCTACATCAATGGGGTCA CAACAAG-CAAACCTGGAGTAT CCTTGG-T 

::....:::..:<...::::.::. 

AGAAAGAGGCCATCCAAGGTGGGATCGTGAGGGTCAACTGTTCTGTCCCAGAGGAAAAGGCCCCAATACA 
560 570 580 590 ^600 610 620 

460 470 480 490 500 
inputs CTACTC CATGCCCTCCCGGAACC — TGTCCCTGC-GGGTGGAGGG TCTC 



CTTCACAATTGAAAAACTTGAACTAAATGAAAAAATGGTCAAGCTGAAAAGAGAGAAGAATTCTCGAGAC 
630 640 650 660 670 680 690 

510 520 530 540 550 
inputs CAGGAGAAAG — ACTCTGG CCCCTAC AG CTGCTC CG TG AATG TG C AAGACAAACAA 



CAGAATTTTGTGATACTGGAATTCCCCGTTGAGGAACAGGACCGCGTTTTATCCTTCCGATGTCAAGCTA 
700 710 720 730 740 750 760 

560 570 580 590 
inputs GG — CAAATCTAGGGGCCA CAG-CATCAAAA CCTTA GAACTCAATG 



GGATCATTTCTGGGATCCATATGCAGACCTCAGAATCTACCAAGAGTGAACTGGTCACCGTGACGGAATC 
770 780 790 800 810 820 830 

600 610 620 630 

inputs -TACT GGTTCCTC — CAGCTCCTCC ATCCTG C-CGTCTCCA — GGGTG 

: - : : ::::: 
CTTCTCTACACCCAAGTTCCACATCAGCCCCACCGGAATGATCATGGAAGGAGCTCAGCTCCACATTAAG 
840 850 860 870 880 890 900 

640 650 660 670 680 690 
inputs TGCCCCATG -TGGGGGC AAACGTGACCCTG-AGCTGCCAG TC TC CAAGGAGTAAG 



TGCACCATTCAAGTGACTCACCTGGCCCAGGAGTTTCCAGAAATCATAATTCAGAAGGACAAGGCGATTG 
910 920 930 940 950 960 970 

700 710 720 

Inputs CCC GCTGT C CAATACCAGTG-GGATC -GGCAGCTT 

::s ::::: : : : . : ::: : 

TGGCCGACAACAGACATGGCAAGAAGGCTGTGTACTCAGTCATGGC 
980 990 1000 1010 1020 1030 1040 

730 740 750 760 770 

Inputs C-CATCCT TCCAGAC TTTCTTTG — CACCAGCATTAGATGTCATCCGTG — GGTCTTTA 

: : • : : • t :.:t x: :::::: * # t • 

CACGTCOVAAGTGGAGTCX^GCCGCATATC^ 

1050 1060 1070 1080 1090 1100 1110 

780 790 800 810 820 830 
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inputs AGCCTCACC-AACCTTTCGTCTTCCAT -GCCTGGA -GTCTATGTCTG — CAAGGCC — CACA 

. s:« .:: x : . ssss: :: : :: :.: 

TTCCAAGCCCGAACTCKSAATCTTCCTTCACACATCTGGACCAAGGTGAAAGACTCAAC^ 
1120 1130 1140 1150 1160 1170 1180 



840 850 860 870 
inputs ATG — AGGTG GGC — ACTGOCCA ATGTAA TGTGACGCTGG AAGTGA 



ATCCCAGGAGCACCTCCAGCCAACTTCACCATCCAGAAGGAAGATACGATTGTGTCACAGACTCAAGATT 
1190 1200 1210 1220 1230 1240 1250 

880 890 900 910 920 930 

inputs GCAC AGGGCCT-GGAG-CTG-CAGTGGTTGCTGAAGCTGT — TGTGGGTACC--CTGGTTGGACTG 



TCACCAAGATAGCCTCAAAGTCGGACAGTGGGACGTATATCTGCACTGCAGGTATTGACAAAGTGGTCAA 
1260 1270 1280 1290 1300 1310 1320 

940 950 960 970 980 
inputs GGGTTGCTG-GCTGGGCTGGTCCTCTTGTA C-CACC — GCCGGGG CAAG GCCCTG 



GAAAAGCAACACAGTCCAGATAGTCGTATGTGAAATGCTCTCCCAGCCCAGGATTTCTTATGATGCCCAG 
1330 1340 1350 1360 1370 1380 1390 



990 1000 1010 1020 1030 1040 
inputs GAGGAGCCAGCCAATGATATCAAG — GAGGATG-CCATTGCTCCCCGGACC CT — GCC C 



TTTGAG-- GTCATAAAAGGACAGACCATCGAAGTCCGTTGCGAATCGATCAGTGGAACTTTGCCTATTTC 
1400 1410 1420 1430 1440 1450 1460 



1050 1060 1070 1080 1090 
inpu t S TGGCC — CAA — GAGCTCAGACACAATCTCCAAGAATGGGACCCTTTCCTCTGTCA — CCTCCG C 

a mm » • ■ • • • •••*«••«• • • • • • * 

i • • • • • • • • • • • • • • « • • • • •••••••• ■ • • • • •••••• - • • • • « • 

TTACCAACTTTTAAAAACAAGTAAAGTTTTGGAGAATAGTACCAAGAACTCAAATGATCCTGCGGTATTC 
1470 1480 1490 1500 1510 1520 1530 

1100 1110 1120 1130 
inputs A CGAGCCCTCCG— GCCACC — ; CCA TGGC CC — TCCCAGGCCT 



AAAGACAACCCCACTGAAGACGTCGAATACCAGTGTGTTGCAGATAATTGCCATTCCCATGCCAAAATGT 
1540 1550 1560 1570 1580 1590 1600 

1140 1150 1160 1170 

inputs GGTGCATTG ACCCCC ACGCCCAG TCTATCCAGCCAGG 

::: : . : s : xzit :::in:.s 

TAAGTGAGGTTCTGAGGGTGAAGGTGATAGCCCCGGTGGATGAGGTCCAGATTTCTATCCTGTCAAGTAA 
1610 1620 1630 1640 1650 1660 1670 

1180 

Lnputs C -C CTGC -CC — TCACCAAG 

: : : : : : :s 

GGTGGTGGAGTCTGGAGAGGACATTGTGCTGCAATGTGCTGTGAATGAAGGATCTGGTCCCATCA 
1680 1690 1700 1710 1720 1730 1740 
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1190 1200 1210 1220 1230 1240 

inputs ACATGC — CCACGAC^GATGGGGCCCACCCT-CAACCAA-TATCX:CC CATCCCTGG TGGGGT 

: : • :: :.: ::::: : . : ::: : : : :;. 

AAGTTTTAC^GAGAAAAAGAGGGCAAACCCTTCTATC^AATGACCTCAAATGCCACCCAGGCATTTT<^A 
1750 1760 1770 1780 1790 1800 1810 

1250 1260 1270 1280 1290 1300 
inputs TTTTTCCTTTGGCTT TGAGCCGCATGGGTG CTG NGCCTGTGATGGNGCCTGCC 



CCAAGCAGAAGGCTAGCAAGGAACAGGAGGGAGAGTATTACTGCACAGCCTTCAACAGAGCCAACCACGC 
1820 1830 1840 1850 * 1860 1870 1880 

1310 1320 1330 1340 1350 

inputs CAGAGTC AAGCTGGCTCTCTG-GT-ATGATGACCCCACCACTCATTGG-CTAAAG 

••••••• ••••••••• •••• • m m . - _•••«« « * • * a • • • > ■ • « 

CTCCAGTGTCCCCAGAAGCAAAATACTGACAGTCAGAGTCATTCTTGCCCCATGGAAGAAAGGACTTATT 
1890 1900 1910 1920 1930 1940 1950 

1360 1370 1380 1390 
inputs G ATTTGGG -GTC-TC TCCTTCCTATAAGGGTCA CCTCTAGCA CAGAGG 



GCAGTGGTTATCATCGGAGTGATCATTGCTCTCTTGATCATTGCGGCCAAATGTTATTTTCTGAGGAAAG 
1960 1970 1980 1990 2000 2010 2020 

1400 1410 1420 1430 1440 1450 
inpu t s CCTGAGTCATG-GGAA AGAGTCACACTCCTGACCCTTAGTAC TCTG — CCCCCACCTC 



CCAAGGCCAAGCAGATGCCAGTGGAAATGTCCAGGCCAGCAGTACCACTTCTGAACTCCAACAACGAGAA 
2030 2040 2050 2060 2070 2080 2090 

1460 1470 1480 1490 1500 1510 

i npu t s TCTTTAC — TGTGGGAAAACCATC- -TCAGTAAGACCTAAGTGTCCAGG AG ACAG AAGGAG AA-GA 

AATGTCAGATCCCAATATGGAAGCTAACAGTCATTACGGTCACAATGAC — GATGTCAGAAACCATGCAA 
2100 2110 2120 2130 2140 2150 2160 

1520 1530 1540 1550 1560 1570 

inputs GGAAGTGGATCTGGAATTGGGAGGAGCCTCCACCCACC-CCTGAC — TCCTCC TTATGAAGCCAGC 

•:: . . • : . : : „: s: : : 

TGAAACCAATAAATGATAATAAAGAGCCTCTGAACTCAGACGTGCAGTACAOGGAAGTTCAAGTGTCCTC 
2170 2180 2190 2200 2210 2220 2230 

1580 1590 1600 1610 1620 1630 1640 

inputs TGCTGAAAOTAGCTACTCACCAAGAGTGAGGGGCAGAGACTTCCAGTCACT GCC — 

• # : •? - ti 

AGCTGAGTCTCACAAAGATCTAGGAAAGAAGGACACAGA^ACAGTGTACAGTGAAGTCOGGAAAGCTC 
2240 2250 2260 2270 2260 2290 2300 

1650 1660 1670 1680 1690 1700 

inputs CCCTTGATCTGTACCCCAC CCCTA — TCTAACACCACCCTTG — GCTCXXACTCCAGCTC—CCT 

:::: . : : : : : - : • : . i : ::. 

CCCTGATGCOGTGGAAAGC&GATACTCTAGAAOGGAAGGCTCCCTTGATGGAACTTA 
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2310 2320 2330 2340 2350 2360 2370 



1710 1720 1730 1740 1750 1760 
inputs GTATTGATATAACCTGTCAGG-CTGGCTTGGTTAG-GTTTTACTGGGG CAGAGGATAGGGA 



GA — TGCACATCCCTGGAAGGACATCCATGTTCCGAGAAGAACAGATAATCCCTGTATTTCAAGACCTCT 
2380 2390 2400 2410 2420 2430 



1770 1780 1790 1800 1810 1820 

inputs -ATCTCTTATTAAAA CTAACATGAAATATGTGTTGTTTTCATTT — GCAAATTTAAATAAAGATACA 



GTGCACTTATTTATGAACCTGCCCTGCTCCCACAGAACACAGCAATTCCTCAGGCTAAGCTGCCGGTTCT 
2440 2450 2460 2470 2480 2490 2500 



1830 1840 1850 1860 

inputs TAAT GTTTGTATGAGATAAGAAAAAAAAAAAAAAAGGGCGGCCGC- 



TAAATCCATCCTGCTAAGTTAATGTTGGGTAGAAAGAGATACAGAGGGG 
2510 2520 2530 2540 2550 
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TANGO 281 

Input file AthPb81dl0.seq; Output File AthPb81dlO .pat 
Sequence length 1812 



M R L 3 

GTCGACCCACGCGTCCGGCGGAGGTTGTGGCTGCACCGTGGTC ATG CGT CTG 73 



F V 
TTT GTC 
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VP* 246 
GTT CCC TGA 802 

GCCTG<XXXCAGCCTCTTTGGCT 881 

TGTGGTGCGTGTGCCTTGTCTAGACATG^^ 96° 

GGGACCAATCTGTTTTCTTCCTCACTTGAAATTGT^ 1039 

TATTTCXXXXTTTCACCXXAAG 1118 

GCTTTGGGGAGTAGtt^GCTAGCTGCTGCT^ "97 

GGGCXrrCAGTGGO^TGGGT^ 1276 

GTCTGGACGGTCCCACTGTCTT^^ 1355 

GGCCAGGGGCCTCTGTCTACTACACACTCTGGTTTCTCCCTACAGTC^^ 1434 

TTTTTCTATO^GATGT^ 1513 
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TCCGTTTCCCTTGACACCAGCTTCATAGAATACCTGACTCCTGTACTACAGTCCAGT^ 1592 

AGGGCCAGGGGTTATCTGGACCAAGGGTGGGGGTGGAGAGCCTGGATGGT^ 1671 

ATTCCCTGTTGGTTCCTGTTTCACTGGCTGTTTTAGTTTTGTG 1750 
TCGTTTATAATAAATGAATATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 1812 
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2.7 7,i "' ' m 

| ' | • | • | i | • | i | i | • | i | i | i | i , 
1 41 81 121 161 201 241 



>hT281 

MRLFVRPSVRPAMAAPAPSPOTLSLLLLLLLPSPGAHGELCRPFGEDNSIPESCPDFCCG 

SCSSQYCCSDVLKKIQWNEEMCPEPESSRFSAHPETPEQLGSALKYQSSLDSDNMPGFGA 

TVAIGLTVFWFIATI IVCFTCSCXCLYKKCCRPRPWSNTTTTTVVHTAYPQPQPVAPS 

YPGPTYQGYHPMPPQPGMPAAPYPTQYPPPYLAQPTGPPAYHETLAGASQPPYNPAYMDP 
PKAVP 
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Alignments of top-scoring domains: 

*SBH: domain 1 of 1, from 97 to 146: score 5.5, E = 8.5 

*~>ktalgelLkPlnseyGKvaPgVK3ttplmgvfmalfavFLliileiYn 
♦ lg+ Lk s +Pg+G t+ +g + + «-f+vF+ i+ + 

hT281 97 PEQLGSALKYQSSLDSDNMPGFGATVAIG - -LTVFWFIATI I VCFT 141 

ssvll<-* 

• s 

hT281 142 CSCCC . 146 
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Input file T281Atmea4 9d3; Output File T28lAtmea49d3 .pat 
Sequence length 1858 

3TCGACCCACGCGTCCGCGCGGAGGTTC 7 9 

MAAPAPSLWTLLLLLLL 17 

7TGTCCCGCC ATG GCT GCG CCG GCG CCC TCT CTG TGG ACC CTA TTG CTG CTG CTG TTG CTG 140 

LPPPPGAHGE' LCRPFGEDNS 37 

!TG CCG CCG CCT CCG GGT GCC CAT GGT GAG CTG TGC AGG CCC TTT GGT GAA GAC AAT TCG 200 

IPVFCPDFCCGSCSNQYCCS 57 

»TC CCA GTG TTC TGT CCT GAT TTC TGT TGT GGT TCC TGT TCC AAC CAA TAC TGC TGC TCG 260 

DVLRKIQWNEEMCPEPESSR 7 7 

AC GTG CTG AGG AAA ATC CAG TGG AAT GAG GAA ATG TGT CCT GAG CCA GAG TCC AGC AGA 32 0 

fSTPAEETPEHLGSALKFRS 97 

TT TCC ACC CCC GCG GAG GAG ACA CCC GAA CAT CTG GGT TCA GCG CTG AAA TTT CGA TCC 38 0 

SFDSDPMSGFGATVAIGVT I 117 

3T TTT GAC AGT GAC CCT ATG TCA GGG TTC GGA GCG ACC GTC GCC ATT GGC GTG ACC ATC 44 0 

? VVFIATI IICFTCSCCCLY 137 

IT GTG GTG TTT ATT GCC ACT ATC ATC ATC TGC TTC ACC TGC TCC TGC TGC TGT CTG TAT 500 

<MCCPQRPVVTNTTTTTVVH 157 

VG ATG TGC TGC CCC CAA CGC CCT GTC GTG ACC AAC ACC ACA ACT ACT ACC GTG GTT CAT 560 

^PYPQPQ PQPVAPSYPGPTY 177 

:C CCT TAC CCT CAG CCT CAA CCT CAA CCT GTG GCC CCC AGC TAT CCT GGA CCA ACA TAC 620 

} G Y H PM P p PARNAS S TLPN A 197 

.G GGC TAC CAT CCC ATG CCC CCC CCA GCC AGG AAT GCC AGC AGC ACC CTA CCC AAC GCA 680 

PTTLPGPAHRAATLP * 214 

A CCC ACC ACC CTA CCT GGC CCA GCC CAC AGG GCC GCC ACC CTA CCA TGA 731 

"CCTTGGCTGGAGCCAGCCAGCCTCCATACAACCCGACCTACATGGATTCCCT 810 

GCCTCTTTGGCTGCC7VTTTATGTCGTGTGTGAGTGA 889 

TTXnXrTAGACATGTGGCTTCCTCTGCTGTTC 968 

TCTTCCTGACTTCAAATTGTACTTTCTGAAATT^ 1047 

CCXAAGGTGACXIAGCCATGGCCreTCATAC 1126 

^AGCTAGCIXXTTtXTTAGGCCTTTATTCC 1205 

rGACAAGGGGACTCAGTGGCAGGGGGTCAC^^^ 1284 
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ACTGTCCTTCCCGGGGCTGTATAGAGGGCCACATGTGTTCA^ 1363 

TGGCTTGGCTCCTGCTCCCAGGCCAGGGACCTCGGTCTGTCT 14 4 2 

rrGTTAGCCAAACATTTTGCCTGTTTTCTGTCTCCAGATC 1521 

\GGACAGACAACCTGACCTCCGACTGTCAGTTTCCCTTGACACCATCTTCAT^ 1600 

:CAGTTtGTCCCAGTAGCAGGGACACCAAGGCCAATGGGTTATCTGGACCAAAGG 1679 

rTCCGGCCCAGATGTGAATACCTCCATATTCCCTGTTGGTTCCTGTTTCACTGGCTC 1758 

STTTCTGAGCATTCAGACTCCGCACCCTCATTTCTAA 1837 

1858 

vAAAAAAAAAGGGCGGCCGC 
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m win r.its 




I i , , , • j i , . | • | i j i j i | « [ « 
1 41 81 121 161 201 



MAAPAPS LWTIXLLLLLLPP PPG AHGELCRPFG EDNS I P VFC PDFCCGSCSNQYCCS DVL 
KKIQWNEEMCPEPESSRFSTPAEETPEHLGSALKFRSSFDSDPMSGFGATVAIGVTIFW 
FIATIIICFTCSCCCLYKMCCPQRPVVTOTTTTTVVHAP 
HPMPPPAKNASSTLPNAVPTTLPGPAHRAATLP 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hT281 a. a. 245 aa vs. 

> raT281 a. a. 213 aa 
scoring matrix: paml20.mat, gap penalties; -12/-4 

66.5% identity; Global alignment score: 739 

10 20 30 40 50 60 70 

inputs* MRLFVTIPSWPAMAAPAPSPOTLSLLLLLLLPSPGAHGELC 

« •••••• *«• **:::: • *5-:": :;:::: :::•:::•: «-_««••«••■»•••••«• 

ii - AAPAPS LWTLLLLLLLLP P PPG AHGELCR P FG EDNS I PVFC PDFCCGSCSNQ YCCS D 

10 * 20 30 40 50 

80 90 100 HO 120 130 

inputs VXKKIQWNEEMCPEPESSRFSAHPE-TPEQIXSSALKYQSSLDSD^ 

VUUUQWE^ 

60 70 80 90 100 110 120 

140 150 160 170 180 190 200 

inputs FTCSCCCLYKMCCRPRPWSOTTTTTV^ 

••••••••••••• t«««»»jjj >4 jjjjjj ::.*...•••••*••••• 

raSCCCLYl^CCPQ^ PARN 

130 140 150 160 170 180 

210 220 230 240 

inputs PPPYLAQPTGPPAYHETLAGASQPPYNPAYMDPPKAVP 

AS STL - - PN AV PT TLPGPAHRA AT LP 

190 200 210 
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SEQUENCE LISTING 



<110> Millennium Pharmaceuticals, Inc. 

<120> SECRETED PROTEINS AND USES THEREOF 

<130> 7853-210-228 

<140> 
<141> 

<150> 09/336,536 
<151> 1999-06-18 

<160> 198 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1338 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 60 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 12 0 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 240 
actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 300 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 3 60 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 420 
gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 4 80 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 54 0 
tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 600 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 660 
gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 720 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 780 
ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 84 0 
gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 
tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 
gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 
gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 
gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 114 0 
ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 
ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 
gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 
aaaaaaaagg gcggccgc 1338 

<210> 2 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 2 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 
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gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 24 0 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 
tctttgct 728 



<210> 3 

<211> 243 

<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 
15 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 



Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 



Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 
85 90 95 



Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 
100 105 110 

Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg Val 
115 120 125 

Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 
130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser lie Ala 
165 170 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 
180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 
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Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 

225 230 235 240 



Val Phe Ala 



<210> 4 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Ser Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His 
1 5 10 15 

Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
35 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 95 

Glu Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg 
100 105 110 

Val Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe 
115 120 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser He 
145 150 155 160 

Ala Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 215 220 
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Pro Val Phe Ala 
225 



<210> 5 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly 
15 10 15 



<210> 6 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly Arg Asp Gly 
15 10 15 

Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu Lys Gly Glu 
20 25 30 

Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro Gly Pro Arg 
35 40 45 

Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala 
50 55 60 



<210> 7 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ser Asp 
1 5 10 15 

Ala Pro Leu Pro Phe Asp Arg Val Leu Val Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Ala Val Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 

Val Lys Asn Gly Glu Ser lie Ala Ser Phe Phe Gin Phe Phe Gly Gly 
65 70 75 80 

Trp Pro Lys Pro Ala Ser Leu Ser Gly Gly Ala Met Val Arg Leu Glu 
85 90 95 

Pro Glu Asp Gin Val Trp Val Gin Val Gly Val Gly Asp Tyr He Gly 
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100 105 110 

lie Tyr Ala Ser He Lys Thr Asp Ser Thr Phe Ser Gly Phe Leu Val 
115 120 125 

<210> 8 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 8 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 
cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 
ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 
ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 
gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 
acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 
cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 42 0 
gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 4 80 
ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 
acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 
acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 
cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 
tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 
gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 
ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 
agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 
ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 
gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 
agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 
ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 
ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 
cgc 1263 

<210> 9 
<211> 729 
<212> DNA 

<213> Mus musculus 
<400> 9 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 
gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 
catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 
gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 
gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 
cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 
acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 
ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 
cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 540 
tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 
cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 
atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 
gtcttcgct 729 

<210> 10 
<211> 243 
<212> PRT 

<213> Mus musculus 
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<400> 10 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 
15 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly Gin Pro 
20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 
35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 
50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala lie Gly Pro Ala Gly 
85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 
100 105 110 

Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg Val 
115 120 125 

Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe Thr 
130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser lie Ala 
165 170 175 

Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 
180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 
195 200 205 

Val Gly Val Gly Asp Tyr lie Gly lie Tyr Ala Ser lie Lys Thr Asp 
210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 11 
<211> 228 
<212> PRT 

<213> Mus musculus 
<400> 11 

Ser Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly Gin 
15 10 15 
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Pro Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro 
20 25 30 

Gly Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly 
35 40 45 

Glu Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu 
50 55 60 

Pro Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala lie Gly Pro Ala 
65 70 75 80 

Gly Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser 
85 90 95 

Glu Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg 
100 105 110 

Val Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe 
115 120 125 

Thr Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val 
130 135 140 

Tyr Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser lie 
145 150 155 160 

Ala Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu 
165 170 175 

Ser Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val 
180 185 190 

Gin Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr 
195 200 205 

Asp Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser 
210 215 220 

Pro Val Phe Ala 
225 



<210> 12 

<211> 15 

<212> PRT 

<213> Mus musculus 

<400> 12 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly 
15 10 15 



<210> 13 

<211> 60 

<212> PRT 

<213> Mus musculus 
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<400> 13 
Gly Thr Pro Gly 
1 

Arg Asp Gly Arg 
20 

Gly Gly Arg Pro 
35 

Gly Glu Ala Gly 
50 



His His Gly Ser 
5 

Asp Gly Ala Pro 

Gly Leu Pro Gly 
40 

Pro Met Gly Ala 
55 



Gin Gly Leu Pro 
10 

Gly Ala Pro Gly 
25 

Pro Arg Gly Glu 



He Gly Pro Ala 
60 



Gly Arg Asp Gly 
15 

Glu Lys Gly Glu 
30 

Pro Gly Pro Arg 
45 



<210> 14 

<211> 128 

<212> PRT 

<213> Mus musculus 

<400> 14 

Ala Phe Ser Ala Lys Arg Ser Glu Ser Arg Val Pro Pro Pro Ala Asp 
1 5 10 15 

Thr Pro Leu Pro Phe Asp Arg Val Leu Leu Asn Glu Gin Gly His Tyr 
20 25 30 

Asp Pro Thr Thr Gly Lys Phe Thr Cys Gin Val Pro Gly Val Tyr Tyr 
35 40 45 

Phe Ala Val His Ala Thr Val Tyr Arg Ala Ser Leu Gin Phe Asp Leu 
50 55 60 

Val Lys Asn Gly Gin Ser He Ala Ser Phe Phe Gin Tyr Phe Gly Gly 
65 70 75 80 

Trp Pro Lys Pro Ala Ser Leu Ser Gly Gly Ala Met Val Arg Leu Glu 
85 90 95 

Pro Glu Asp Gin Val Trp Val Gin Val Gly Val Gly Asp Tyr He Gly 
100 105 110 

He Tyr Ala Ser He Lys Thr Asp Ser Thr Phe Ser Gly Phe Leu Val 
115 120 125 



<210> 15 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 15 

gtcgacccac gcgtccgcgg acgcgtgggt 

tactctggca ccactctcca ggctgccatg 

cttttgtcat ggtcgggacc cctccaagga 

cgccgactag ctgctttaga ggaacggctg 

gctgctgagc tgcgggactt caagaacaag 

gagcgggagg cactcagaac tgaggccgac 

cgggaggtag actatctgga gacccagaac 



gaggggaaga ggctgactgt acgttccttc 60 
gggcccagca cccctctcct catcttgttc 120 
cagcagcacc accttgtgga gtacatggaa 180 
gcccagtgcc aggaccagag tagtcggcat 240 
atgctgccac tgctggaggt ggcagagaag 3 00 
accatctccg ggagagtgga tcgtctggag 360 
ccagctctgc cctgtgtaga gtttgatgag 420 
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aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctc 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 162 0 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 16 
<211> 1218 
<212> DNA 

<213> Homo sapiens 
<400> 16 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 12 0 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 3 00 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 3 60 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 420 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 480 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 54 0 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt 1218 

<210> 17 
<211> 406 
<212> PRT 

<213> Homo sapiens 
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<400> 17 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr lie Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr lie Ser Gin Val Arg Ser 
130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 



Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 
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Thr Leu Asp Thr Glu Gin Gin Trp 
305 310 

Ala Glu Ala Ala Phe Val He Cys 
325 

Thr Arg Pro Ala Ser Arg Ala Arg 
340 

Gly Thr Leu Thr Pro Glu Arg Ala 
355 360 

Tyr Gly Ala His Ala Ser Leu Arg 
370 375 

Tyr Ala Trp Asp Asp Gly Tyr Gin 
385 390 

Lys Lys Glu Glu Glu Val 
405 



Asp Thr Pro Cys Pro Arg Glu Asn 
315 320 

Gly Thr Leu Tyr Val Val Tyr Asn 
330 335 

He Gin Cys Ser Phe Asp Ala Ser 
345 350 

Ala Leu Pro Tyr Phe Pro Arg Arg 
365 

Tyr Asn Pro Arg Glu Arg Gin Leu 
380 

He Val Tyr Lys Leu Glu Met Arg 
395 400 



<210> 18 
<211> 385 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 
15 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 
20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 
35 40 45 

Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala Asp Thr He Ser Gly 
50 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys Val Thr Gly Gly Pro 
85 90 95 

Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 
100 105 110 

Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser Met Lys He Leu Lys 
115 120 125 

Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys Asp Pro Leu Gly Gin 
130 135 140 

Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 
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Val Phe Pro Arg Leu 
165 

Ala Ser Arg Val Arg 
180 

Val Tyr Gly Gly Phe 
195 

Gly Gly Gly Gly Glu 
210 

Leu Ala Asn Arg Thr 
225 

Leu lie Pro Pro Tyr 
245 

Ala Asp Glu Glu Gly 
260 

Arg His Leu Cys Leu 
275 

Gin Gin Trp Asp Thr 
290 

Val lie Cys Gly Thr 
305 

Arg Ala Arg lie Gin 
325 

Glu Arg Ala Ala Leu 
340 

Ser Leu Arg Tyr Asn 
355 

Gly Tyr Gin He Val 
370 

Val 
385 



Arg Asp Phe Thr Leu Ala 
170 

Val Pro Phe Pro Trp Val 
185 

Leu Tyr Phe Ala Arg Arg 
200 

Met Glu Asn Thr Leu Gin 
215 

Val Val Asp Ser Ser Val 
230 235 

Gly Leu Thr Ala Asp Thr 
250 

Leu Trp Ala Val Tyr Ala 
265 

Ala Lys Leu Asp Pro Gin 
280 

Pro Cys Pro Arg Glu Asn 
295 

Leu Tyr Val Val Tyr Asn 
310 315 

Cys Ser Phe Asp Ala Ser 
330 

Pro Tyr Phe Pro Arg Arg 
345 

Pro Arg Glu Arg Gin Leu 
360 

Tyr Lys Leu Glu Met Arg 
375 



Met Ala Ala Arg Lys 
175 

Gly Thr Gly Gin Leu 
190 

Pro Pro Gly Arg Pro 
205 

Leu lie Lys Phe His 
220 

Phe Pro Ala Glu Gly 
240 

Tyr He Asp Leu Ala 
255 

Thr Arg Glu Asp Asp 
270 

Thr Leu Asp Thr Glu 
285 

Ala Glu Ala Ala Phe 
300 

Thr Arg Pro Ala Ser 
320 

Gly Thr Leu Thr Pro 
335 

Tyr Gly Ala His Ala 
350 

Tyr Ala Trp Asp Asp 
365 

Lys Lys Glu Glu Glu 
380 



<210> 19 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly 
20 
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<210> 20 

<211> 244 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Leu Leu Leu Gly Ala Val Leu Leu Leu Leu Ala Leu Pro Gly His 
15 10 15 

Asp Gin Glu Thr Thr Thr Gin Gly Pro Gly Val Leu Leu Pro Leu Pro 
20 25 30 

Lys Gly Ala Cys Thr Gly Trp Met Ala Gly He Pro Gly His Pro Gly 
35 40 45 

His Asn Gly Ala Pro Gly Arg Asp Gly Arg Asp Gly Thr Pro Gly Glu 
50 55 60 

Lys Gly Glu Lys Gly Asp Pro Gly Leu He Gly Pro Lys Gly Asp He 
65 70 75 80 

Gly Glu Thr Gly Val Pro Gly Ala Glu Gly Pro Arg Gly Phe Pro Gly 
85 90 95 

He Gin Gly Arg Lys Gly Glu Pro Gly Glu Gly Ala Tyr Val Tyr Arg 
100 105 110 

Ser Ala Phe Ser Val Gly Leu Glu Thr Tyr Val Thr lie Pro Asn Met 
115 120 125 

Pro He Arg Phe Thr Lys He Phe Tyr Asn Gin Gin Asn His Tyr Asp 
130 135 140 

Gly Ser Thr Gly Lys Phe His Cys Asn lie Pro Gly Leu Tyr Tyr Phe 
145 150 155 160 

Ala Tyr His lie Thr Val Tyr Met Lys Asp Val Lys Val Ser Leu Phe 
165 170 175 

Lys Lys Asp Lys Ala Met Leu Phe Thr Tyr Asp Gin Tyr Gin Glu Asn 
180 185 190 

Asn Val Asp Gin Ala Ser Gly Ser Val Leu Leu His Leu Glu Val Gly 
195 200 205 

Asp Gin Val Trp Leu Gin Val Tyr Gly Glu Gly Glu Arg Asn Gly Leu 
210 215 220 

Tyr Ala Asp Asn Asp Asn Asp Ser Thr Phe Thr Gly Phe Leu Leu Tyr 
225 230 235 240 

His Asp Thr Asn 



<210> 21 
<211> 1721 
<212> DNA 
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<213> Mus musculus 



<400> 21 

gtcgacccac gcgtccgact taaggctgcc 
ttctttttgt catggacggg accccttcag 
gaacgccgac cagctgcctt agaggaacgg 
catgctgccg agcttcggga cttcaaaaac 
aaggagcggg agaccctcag aactgaagca 
gaaagggagg tagactatct ggagacacag 
gagaaggtga ctggaggtcc tggagccaaa 
atggtgacgg actgtagcta cacagtcgct 
tttggtggtt cagttggcct atggaccaag 
gtgttagacg gcacccagaa cgacacggct 
cttgccatgg ctgcccggaa agcttcccga 
gggcagctgg tgtacggtgg cttcctttat 
gggggtggtg aattggagaa cactctgcag 
gtggtggata gctcagtgtt ccctgcagag 
gatacatata tcgacctggc agctgatgag 
gatgatgaca ggcatttgtg tctagccaag 
cagtgggaca caccatgtcc cagagagaac 
ctgtacgttg tctataacac ccgccctgcc 
gccagtggta ctctcgcccc tgaaagggca 
gcccatgcca gccttcgcta taacccccgt 
taccagattg tctacaaatt ggagatgaag 
cttgtgctct tgattcttat gcccagacat 
tccttcaaaa cgaaggccag tggtggtagc 
aattctcaag cccctctgtt ttatgcagaa 
aacagcaaac aaacacccta aatcttcact 
aactcagagc cctgtccttt ggagagggtc 
ccctcagtat gaccgaaggg agagaactca 
tccagtgtag gggagaatgg ggctttcccc 
aaaaggaaaa cccaccaaaa aaaaaaaaaa 



atggggccca gtgctcctct gctgctcctc 60 
ggacagcagc accaccttgt ggagtacatg 120 
ctggcccaat gccaggatca gagtagtcgg 180 
aagatgttgc ctctcctgga ggtggcagag 24 0 
gactccatct caggaagagt ggaccgtctt 300 
aacccagctt tgccctgtgt agagctggat 360 
ggcaagggcc gaagaaatga gaaatacgat 42 0 
caggtgaggt caatgaagat cctgaagcgg 4 80 
gatccgctgg ggccagcaga gaagatctac 540 
tttgtcttcc caaggctgcg tgacttcacc 600 
attcgggtgc ccttcccctg ggtaggcacg 660 
tatgctcgaa ggcctcctgg aggacctgga 720 
ctgatcaaat ttcacttggc aaaccgaaca 780 
agcctgatac ccccctacgg cctgacagca 840 
gagggcctgt gggctgtcta tgccactcga 900 
ttagacccac agacacttga cacagagcag 960 
gcagaggctg cgtttgtcat ctgtgggacc 1020 
agtagggctc gtattcagtg ttccttcgat 1080 
gcactctcct attttccacg ccgatatggt 1140 
gagcgccagc tgtatgcctg ggatgatggc 1200 
aagaaggagg aggaagttta agcagctagc 1260 
ttatattcct gtgagctctc ctgcagttca 1320 
tcatataccc taatttctaa aggacaacca 1380 
ctccagatcc tgggtagcat tttagaactg 1440 
cctgccttat gtccacaaag tttagttcca 1500 
aaccccagac agcaggcgac agcattcttg 1560 
gagacaaagc tgccctccct cccttccccc 1620 
acatcacttt gtatggtaac agtttgcatt 1680 
agggcggccg c 1721 



<210> 22 

<211> 1218 

<212> DNA 

<213> Mus musculus 



<400> 22 

atggggccca gtgctcctct gctgctcctc 
ggacagcagc accaccttgt ggagtacatg 
ctggcccaat gccaggatca gagtagtcgg 
aagatgttgc ctctcctgga ggtggcagag 
gactccatct caggaagagt ggaccgtctt 
aacccagctt tgccctgtgt agagctggat 
ggcaagggcc gaagaaatga gaaatacgat 
caggtgaggt caatgaagat cctgaagcgg 
gatccgctgg ggccagcaga gaagatctac 
tttgtcttcc caaggctgcg tgacttcacc 
attcgggtgc ccttcccctg ggtaggcacg 
tatgctcgaa ggcctcctgg aggacctgga 
ctgatcaaat ttcacttggc aaaccgaaca 
agcctgatac ccccctacgg cctgacagca 
gagggcctgt gggctgtcta tgccactcga 
ttagacccac agacacttga cacagagcag 
gcagaggctg cgtttgtcat ctgtgggacc 
agtagggctc gtattcagtg ttccttcgat 
gcactctcct attttccacg ccgatatggt 



ttctttttgt catggacggg accccttcag 60 
gaacgccgac tagctgcctt agaggaacgg 120 
catgctgccg agcttcggga cttcaaaaac 180 
aaggagcggg agaccctcag aactgaagca 240 
gaaagggagg tagactatct ggagacacag 3 00 
gagaaggtga ctggaggtcc tggagccaaa 360 
atggtgacgg actgtagcta cacagtcgct 420 
tttggtggtt cagttggcct atggaccaag 4 80 
gtgttagacg gcacccagaa cgacacggct 540 
cttgccatgg ctgcccggaa agcttcccga 600 
gggcagctgg tgtacggtgg cttcctttat 660 
gggggtggtg aattggagaa cactctgcag 720 
gtggtggata gctcagtgtt ccctgcagag 780 
gatacatata tcgacctggc agctgatgag 840 
gatgatgaca ggcatttgtg tctagccaag 900 
cagtgggaca caccatgtcc cagagagaac 960 
ctgtacgttg tctataacac ccgccctgcc 1020 
gccagtggta ctctcgcccc tgaaagggca 1080 
gcccatgcca gccttcgcta taacccccgt 1140 
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gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 
aagaaggagg aggaagtt 1218 

<210> 23 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 23 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 
15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser lie Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 
130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 
210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
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260 265 270 

Tyr lie Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 
355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 
370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 



<210> 24 
<211> 385 
<212> PRT 

<213> Mus musculus 
<400> 24 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 
15 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 
20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 
35 40 45 

Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala Asp Ser He Ser Gly 
50 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys Val Thr Gly Gly Pro 
85 90 95 

Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 
100 105 110 

Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser Met Lys He Leu Lys 
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115 120 125 

Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys Asp Pro Leu Gly Pro 
130 135 140 

Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 150 155 160 

Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala Met Ala Ala Arg Lys 
165 170 175 

Ala Ser Arg He Arg Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu 
180 185 190 

Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg Pro Pro Gly Gly Pro 
195 200 205 

Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin Leu He Lys Phe His 
210 215 220 

Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val Phe Pro Ala Glu Ser 
225 230 235 240 

Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr Tyr He Asp Leu Ala 
245 250 255 

Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala Thr Arg Asp Asp Asp 
260 265 270 

Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin Thr Leu Asp Thr Glu 
275 280 285 

Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn Ala Glu Ala Ala Phe 
290 295 300 

Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr Arg Pro Ala Ser 
305 310 315 320 

Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser Gly Thr Leu Ala Pro 
325 330 335 

Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg Tyr Gly Ala His Ala 
340 345 350 

Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr Ala Trp Asp Asp 
355 360 365 

Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys Lys Lys Glu Glu Glu 
370 375 380 

Val 
385 



<210> 25 
<211> 21 
<212> PRT 

<213> Mus musculus 
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<400> 25 

Met Gly Pro Ser Ala Pro Leu Leu 
1 5 

Gly Pro Leu Gin Gly 
20 



Leu Leu Phe Phe Leu Ser Trp Thr 
10 15 



<210> 26 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all M n" positions 
<223> n=a, c, g, or t 

<400> 26 

gtcgacccac gcgtncntcc agcgtncgga 
gcgcacgctc cggccgtcgc gcagcctcgg 
cgcccctgac tccgtcccgg ccagggaggg 
ccaacttgnt gcggtttttg ttcctggggc 
agctgcaact gcacttgccc gccaaccggt 
cttcagcatg gtacaccttg cacagggagg 
ttgtgatgtg gttcttcaaa cagaaagaaa 
gggtcacaac aagcaaacct ggagtatcct 
ccctgcgggt ggagggtctc caggagaaag 
tgcaagacaa acaaggcaaa tctaggggcc 
tggttcctcc agctcctcca tcctgccgtc 
tgaccctgag ctgccagtct ccaaggagta 
agcttccatc cttccagact ttctttgcac 
gcctcaccaa cctttcgtct tccatggctg 
tgggcactgc ccaatgtaat gtgacgctgg 
ttgctgaagc tgttgtgggt accctggttg 
tgtaccaccg ccggggcaag gccctggagg 
ttgctccccg gaccctgccc tggcccaaga 
tttcctctgt cacctccgca cgagccctcc 
cattgacccc cacgcccagt ctatccagcc 
acagatgggg cccaccctca accaatatcc 
tttgagccgc atgggtgctg ngcctgtgat 
ggtatgatga ccccaccact cattggctaa 
tcacctctag cacagaggcc tgagtcatgg 
ctgcccccac ctctctttac tgtgggaaaa 
acagaaggag aagaggaagt ggatctggaa 
ctccttatga agccagctgc tgaaattagc 
cagtcactga gtctcccagg cccccttgat 
tggctcccac tccagctccc tgtattgata 
actggggcag aggataggga atctcttatt 
tttgcaaatt taaataaaga tacataatgt 
ggcggccgc 

<210> 27 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 



gccgccctgg gtgtcagcgg ctcggctccc 60 
cacctgcagg tccgtgcgtc ccgcggctgg 120 
ccatgatttc cctcccgggg cccctggtga 180 
tgagtgccct cgcgcccccc tcgcgggccc 240 
tgcaggcggt ggaggagggg gaaagtggtg 300 
tgtcttcatc ccagccatgg gaggtgccct 360 
aggaggatca ggtgttgtcc tacatcaatg 420 
tggtctactc catgccctcc cggaacctgt 480 
actctggccc ctacagctgc tccgtgaatg 540 
acagcatcaa aaccttagaa ctcaatgtac 600 
tccagggtgt gccccatgtg ggggcaaacg 660 
agcccgctgt ccaataccag tgggatcggc 720 
cagcattaga tgtcatccgt gggtctttaa 780 
gagtctatgt ctgcaaggcc cacaatgagg 840 
aagtgagcac agggcctgga gctgcagtgg 900 
gactggggtt gctggctggg ctggtcctct 960 
agccagccaa tgatatcaag gaggatgcca 1020 
gctcagacac aatctccaag aatgggaccc 1080 
ggccacccca tggccctccc aggcctggtg 1140 
aggccctgcc ctcaccaaga catgcccacg 1200 
cccatccctg gtggggtttt ttcctttggc 1260 
ggngcctgcc cagagtcaag ctggctctct 1320 
aggatttggg gtctctcctt cctataaggg 1380 
gaaagagtca cactcctgac ccttagtact 1440 
ccatctcagt aagacctaag tgtccaggag 1500 
ttgggaggag cctccaccca cccctgactc 1560 
tactcaccaa gagtgagggg cagagacttc 1620 
ctgtacccca cccctatcta acaccaccct 1680 
taacctgtca ggctggcttg gttaggtttt 1740 
aaaactaaca tgaaatatgt gttgttttca 1800 
ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

1869 
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<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 27 

atgatttccc tcccggggcc cctggtgacc 
agtgccctcg cgcccccctc gcgggcccag 
caggcggtgg aggaggggga aagtggtgct 
tcttcatccc agccatggga ggtgcccttt 
gaggatcagg tgttgtccta catcaatggg 
gtctactcca tgccctcccg gaacctgtcc 
tctggcccct acagctgctc cgtgaatgtg 
agcatcaaaa ccttagaact caatgtactg 
cagggtgtgc cccatgtggg ggcaaacgtg 
cccgctgtcc aataccagtg ggatcggcag 
gcattagatg tcatccgtgg gtctttaagc 
gtctatgtct gcaaggccca caatgaggtg 
gtgagcacag ggcctggagc tgcagtggtt 
ctggggttgc tggctgggct ggtcctcttg 
ccagccaatg atatcaagga ggatgccatt 
tcagacacaa tctccaagaa tgggaccctt 
ccaccccatg gccctcccag gcctggtgca 
gccctgccct caccaagaca tgcccacgac 
catccctggt ggggtttttt cctttggctt 



aacttgntgc ggtttttgtt cctggggctg 60 
ctgcaactgc acttgcccgc caaccggttg 120 
tcagcatggt acaccttgca cagggaggtg 180 
gtgatgtggt tcttcaaaca gaaagaaaag 240 
gtcacaacaa gcaaacctgg agtatccttg 300 
ctgcgggtgg agggtctcca ggagaaagac 360 
caagacaaac aaggcaaatc taggggccac 420 
gttcctccag ctcctccatc ctgccgtctc 480 
accctgagct gccagtctcc aaggagtaag 54 0 
cttccatcct tccagacttt ctttgcacca 600 
ctcaccaacc tttcgtcttc catggctgga 660 
ggcactgccc aatgtaatgt gacgctggaa 72 0 
gctgaagctg ttgtgggtac cctggttgga 780 
taccaccgcc ggggcaaggc cctggaggag 84 0 
gctccccgga ccctgccctg gcccaagagc 900 
tcctctgtca cctccgcacg agccctccgg 960 
ttgaccccca cgcccagtct atccagccag 102 0 
agatggggcc caccctcaac caatatcccc 1080 

1110 



<210> 28 
<211> 370 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (13) 

<22 3> Xaa=unknown amino acid 
<400> 28 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 HO 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
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115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 
130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 
165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 
180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 
195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 
210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 
245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 
260 265 270 

Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 
275 280 285 

Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 
290 295 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 320 

Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 
325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 
340 345 350 

Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 



<210> 29 
<211> 341 
<212> PRT 

<213> Mus musculus 
<400> 29 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 
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15 10 15 

Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 
20 25 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 
35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr 
50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 
85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 
100 105 HO 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 
115 120 125 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 
130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 
165 170 175 

Arg Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 
180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 
195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala 
210 215 220 

Val Val Gly Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu 
225 230 235 240 

Leu Tyr His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He 
245 250 255 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser 
260 265 270 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
275 280 285 

Ala Leu Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro 
290 295 300 

Thr Pro Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His 
305 310 315 320 
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Asp Arg Trp Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly 
325 330 335 

Phe Phe Leu Trp Leu 
340 



<210> 


30 


<211> 


29 


<212> 


PRT 


<213> 


Mus i 


<220> 




<221> 


SITE 


<222> 


(13) 


<223> 


Xaa= 


<400> 


30 



Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
1 5 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala 
20 25 



<210> 31 

<211> 246 

<212> PRT 

<213> Mus musculus 

<220> 

<221> SITE 
<222> (13) 

<22 3> Xaa=unknown amino acid 
<400> 31 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 
15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 
20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
100 105 110 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
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115 



120 125 



Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 
130 135 140 



Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 



150 155 160 



Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 
165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 
180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 
195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 
2io 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly 
245 



<210> 32 
<211> 653 
<212> DNA 

<213> Homo sapiens 



<400> 32 

ttttttgcat gtaacttttt tattgaggca 
aggcagtcag tttagtaagc tgaacgttaa 
cattcacagc ttgactagcg aggctacatc 
ttgtcattca gcttgatttt tcacctcagg 
ctctaggaac aaaaaacatt ttcctaaacc 
cttcaatccc actaattaga acaccatcct 
cttgataaag aagatagcct gaaaacgtag 
ccctgtgcag acgtaaccat atctggtctc 
tagcagtttc atggtctgga gcaccatcat 
tcttcantga ggttgacata cagtggcaca 
aaaaacgtag tgccattcac ttggcaatta 

<210> 33 

<211> 25 

<212> PRT 

<213> Mus musculus 



caacaaggca ttgtaacttg cctggacttg 60 
tacagttaag gattaagtgc aaacaatata 120 
acaatttata aagtgccaga ttagtgctaa 180 
aaggaaaaca aaaaagtaag gacctcctcc 240 
aatcagtcat gagggcaaag actacttttc 3 00 
tttattgtca atactgtact gactttcaat 360 
aatatttcca gctacttcca taaattgctc 420 
cctggaagag ctgaagaatt gcatgattgc 4 80 
tggcataggc tgataccaag acctcttcat 540 
ttcactgcca gcttttacat gtgaaaaatg 600 
aatctaccaa agctgagatc aaa . 653 



<400> 33 , 
Ala Ala Val Val Ala Glu Ala Val Val Gly Thr Leu Val Gly Leu Gly 
1 5 10 I 5 



Leu Leu Ala Gly Leu Val Leu Leu Tyr 
20 25 
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<210> 34 
<211> 99 
<212> PRT 

<213> Mus musculus 
<400> 34 

His Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu 
1 5 10 15 

Asp Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr 
20 25 30 

lie Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 40 45 

Arg Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro 
50 55 60 

Ser Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg 
65 70 75 80 

Trp Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe 
85 90 95 

Leu Trp Leu 



<210> 35 
<211> 80 
<212> PRT 

<213> Mus musculus 
<400> 35 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 
15 10 15 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
20 25 30 

Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr Ser Lys Pro 
35 40 45 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 
50 55 60 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 
65 * 70 75 80 



<210> 36 

<211> 60 

<212> PRT 

<213> Mus musculus 
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<400> 36 

Gly Ala Asn Val Thr Leu Ser Cys Gin Ser Pro Arg Ser Lys Pro Ala 
15 10 15 

Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro Ser Phe Gin Thr Phe Phe 
20 25 30 

Ala Pro Ala Leu Asp Val lie Arg Gly Ser Leu Ser Leu Thr Asn Leu 
35 40 45 

Ser Ser Ser Met Ala Gly Val Tyr Val Cys Lys Ala 
50 55 60 



<210> 37 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 37 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 1846 

<210> 38 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 38 
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atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 
agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 
aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 
gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 240 
gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 
acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 
gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 
ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 
tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 
aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 
tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 
atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 
accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 
tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 
ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 
accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 
gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 
gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 
gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 1140 
cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 39 
<211> 394 
<212> PRT 

<213> Mus musculus 
<400> 39 

Met lie Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 
130 135 140 

He Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
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165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val 
210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 
245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 
260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 
275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 
290 295 300 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
305 310 315 320 

Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr 
325 330 335 

Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro 
340 345 350 

Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly 
355 360 365 

Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val 
370 375 380 

Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
385 390 



<210> 40 

<211> 365 

<212> PRT 

<213> Mus 



musculus 



<400> 40 

Gin Met Glu Leu His Val Pro Pro 
1 5 

Glu Gly Glu Glu Val Val Leu Pro 
20 

Glu Ser Trp Ser His Pro Arg Glu 



Gly Leu Asn Lys Leu Glu Ala Val 
10 15 

Ala Trp Tyr Thr Met Ala Arg Glu 
25 4 * 30 

Val Pro He Leu lie Trp Phe Leu 
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35 40 45 

Glu Gin Glu Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr lie Asn Gly 
50 55 60 

Val Met Thr Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser 
65 70 75 80 

Arg Asn Val Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly 
85 90 95 

Thr Tyr Arg Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He 
100 105 HO 

Gly His Ser He Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala 
115 120 125 

Pro Pro Ser Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val 
130 135 140 

Thr Leu Asn Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin 
145 150 155 160 

Trp Glu Arg Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu 
165 170 175 

Asp Ala Val Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met 
180 185 190 

Ser Gly Val Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys 
195 200 205 

Cys Asn Val Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val 
210 215 220 

Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly 
225 230 235 240 

Leu Val Leu Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala 
245 250 255 

Asn Asp He Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr 
260 265 270 

Lys Gly Ser Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr 
275 280 285 

Ser Ala Arg Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly 
290 295 300 

Thr Phe Thr Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro 
305 310 315 320 

Arg Leu Pro Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr 
325 330 335 

Pro Gly Gly Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro 
340 345 350 



28 



WO 00/78808 



PCT/US00/16883 



Val Met Val Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
355 360 365 



<210> 41 
<211> 29 
<212> PRT 

<213> Mus musculus 
<400> 41 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala 
20 25 



<210> 42 
<211> 249 
<212> PRT 

<213> Mus musculus 
<400> 42 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 
15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 
20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 
35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 
50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 
85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 
100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 
115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 
130 135 140 

lie Lys Ser He Glu Leu Lys Val Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 
165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 
180 185 190 
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Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 
195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser lie Ala Met Ser Gly Val 
210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys 
245 



<210> 43 
<211> 355 
<212> PRT 

<213> Mus musculus 
<220> 

<221> SITE 
<222> (355) 

<223> Xaa=unknown amino acid 
<400> 43 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 
1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 
35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 
50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 
85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 
100 105 110 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 
115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 
130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 
165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 
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180 



185 



190 



Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 
195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 
210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 230 235 240 

Leu lie Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 
245 250 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 
260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 
275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 
290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 " 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 
340 345 350 



Gly Pro Xaa 
355 



<210> 44 

<211> 25 

<212> PRT 

<213> Mus musculus 

<400> 44 

Ala Ala Val Val Ala Gly Ala Val Val Gly Thr Phe Val Gly Leu Val 
! 5 10 15 

Leu lie Ala Gly Leu Val Leu Leu Tyr 
20 25 



<210> 45 

<211> 120 

<212> PRT 

<213> Mus musculus 

<400> 45 

Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He Lys Glu 
1 5 10 15 
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Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser Asp Thr 
20 25 30 

He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu 
35 40 45 

Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr Pro Thr 
50 55 60 

Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro Arg Val 
65 70 75 80 

Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly Val Ser 
85 90 95 

Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val Pro Ala 
100 105 HO 



Gin Ser Gin Ala Gly Ser Leu Val 
115 120 



<210> 46 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gtcgacccac gcgtccggcg gaggttgtgg 
cttgatgcgt ctgtttgtcc gtccgtccgt 
tccgtggacc ctttcgctgc tgctgttgtt 
gctgtgcagg cccttcggtg aagacaattc 
tggctcctgt tccagccaat actgctgctc 
ggaaatgtgc cctgagccag agtccagcag 
gctgggttca gcgctgaagt atcagtccag 
agcgaccgtg gccatcggcc tgaccgtctt 
ctttacctgc tcctgctgct gtctatataa 
caacaccaca actactaccg tggttcacac 
cagctatcct ggaccaacat accagggcta 
agcagcaccc tacccaacgc agtaccctcc 
agcctatcat gagacgttgg ctggagccag 
tcccccaaag gcagttccct gagcctgccc 
catgtgtgtg tgagtgctat gcagagttct 
ctagacatgt ggcttcctct gctgatgacc 
gggaccaatc tgttttcttc ctcacttgaa 
aaaacaatag ggtaggaggt atttcccgct 
acacatagga gagcaagctt tttgtgggtc 
ctgctgctat gggtttattc ccagggcttg 
cctcagtggc agtgggtcag tgactgatgt 
tctccatcag ctgtctgtct ggacggtccc 
acatgtattc actattcagg ctccagtggc 
actctggttt ctccctacag tgtcttttta 
tatccagatg tgtgataatt ggtgaggttg 
tgacctctga cagtccgttt cccttgacac 
acagtccagt ttgttccagt agcagggaca 
gggggtggag agcctggatg gtagctctgg 
tggttcctgt ttcactggct gttttagttt 
ctccgcaccc tcgtttataa taaatgaata 



ctgcaccgtg gtcctgggct tggtcctggg 60 
ccgtcccgcc atggctgcgc cggcgccctc 120 
gctactgccg tctccgggtg cccatggcga 180 
gatcccagag tcctgtcctg acttctgttg 240 
tgacgtgctg aagaaaatcc agtggaatga 300 
attttccgcc cacccggaga caccagaaca 360 
tcttgacagt gacaacatgc cagggttcgg 420 
cgtggtgttt atcgctacca tcattgtgtg 4 80 
gatgtgctgc cgcccacgac ctgtcgtgtc 540 
cgcttaccct cagcctcaac ctgtggcccc 600 
ccatcccatg cccccccagc caggaatgcc 660 
accctacctg gcccagccca cagggccacc 720 
ccagcctcca tacaacccgg cctacatgga 780 
ccagcctctt tggctaacat ttgattatgt 840 
ttactgctgt ctgtggtgcg tgtgccttgt 900 
aggtaggcac aaatcttacc agtgctggtt 960 
attgtaattt ctgaaatttc aagtaaatta 1020 
tcaccccaag gtgaccagcc atagcctgcc 1080 
catgtcctgc tttggggagt agccagctag 114 0 
gctgcattta gctggacaga gaacaagggg 1200 
cagagcacac taggcagaga gccccgtccg 1260 
actgtctttc ctgggactat gtagagggcc 1320 
ttccaggcca ggggcctctg tctactacac 1380 
cgattagcca aacatattgc ctgttttttg 1440 
aaatccttgg ttcctggaga acaggaaacc 1500 
cagcttcata gaatacctga ctcctgtact 1560 
ccagggccag gggttatctg gaccaagggt 1620 
accagatgtg aatgcctcca tattccctgt 1680 
tgtgttaatt ggtgtttctg agcattcaaa 1740 
tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 
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a 18 °1 

<210> 47 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 47 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 
tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 
tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 
tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 
atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 
ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 
accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 
acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 
accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 
tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 
gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 
tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 
ccaaaggcag ttccc 735 

<210> 48 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn 
35 40 45 

Ser He Pro Glu Ser Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser 
50 55 60 

Gin Tyr Cys Cys Ser Asp Val Leu Lys Lys He Gin Trp Asn Glu Glu 
65 70 75 80 

Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Ala His Pro Glu Thr 
85 90 95 

Pro Glu Gin Leu Gly Ser Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser 
100 105 HO 

Asp Asn Met Pro Gly Phe Gly Ala Thr Val Ala He Gly Leu Thr Val 
115 120 125 

Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys Ser Cys 
130 135 140 

Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn 
145 150 155 160 



33 



WO 00/78808 



PCT/USOO/16883 



Thr Thr Thr Thr Thr Val Val His 
165 

Val Ala Pro Ser Tyr Pro Gly Pro 
180 

Pro Pro Gin Pro Gly Met Pro Ala 
195 200 

Pro Pro Tyr Leu Ala Gin Pro Thr 
210 " 215 

Leu Ala Gly Ala Ser Gin Pro Pro 
225 230 

Pro Lys Ala Val Pro 
245 



Thr Ala Tyr Pro Gin Pro Gin Pro 
170 175 

Thr Tyr Gin Gly Tyr His Pro Met 
185 190 

Ala Pro Tyr Pro Thr Gin Tyr Pro 
205 

Gly Pro Pro Ala Tyr His Glu Thr 
220 

Tyr Asn Pro Ala Tyr Met Asp Pro 
235 240 



<210> 49 

<211> 38 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
1 ~ 5 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 
20 25 30 

Ser Pro Gly Ala His Gly 
35 



<210> 50 

<211> 207 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Glu Ser Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala He Gly Leu Thr Val Phe Val Val Phe He Ala 
85 90 95 
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Thr He He Val Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met 
100 105 HO 

Cys Cys Arg Pro Arg Pro Val Val Ser Asn Thr Thr Thr Thr Thr Val 
115 120 125 

Val His Thr Ala Tyr Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro 
130 135 140 

Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Gin Pro Gly Met 
145 150 155 160 

Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro Pro Pro Tyr Leu Ala Gin 
165 170 175 

Pro Thr Gly Pro Pro Ala Tyr His Glu Thr Leu Ala Gly Ala Ser Gin 
180 185 190 

Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro Pro Lys Ala Val Pro 
195 200 205 



<210> 51 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<400> 51 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Glu Ser Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Ala His Pro Glu Thr Pro Glu Gin Leu Gly Ser 
50 55 60 

Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser Asp Asn Met Pro Gly Phe 
65 70 75 80 

Gly Ala Thr Val Ala 
85 



<210> 52 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 52 

He Gly Leu Thr Val Phe Val Val Phe He Ala Thr He He Val Cys 
1 5 10 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 25 
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<210> 53 
<211> 97 
<212> PRT 

<213> Homo sapiens 

<400> 53 ^ _ 

Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn Thr Thr Thr Thr 
1 5 10 15 

Thr Val Val His Thr Ala Tyr Pro Gin Pro Gin Pro Val Ala Pro Ser 
20 25 30 

Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Gin Pro 
35 40 45 

Gly Met Pro Ala Ala Pro Tyr Pro Thr Gin Tyr Pro Pro Pro Tyr Leu 
50 55 60 

Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr Leu Ala Gly Ala 
65 70 75 80 

Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro Pro Lys Ala Val 
85 90 95 

Pro 



<210> 54 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 54 

Cys Arg Pro Phe Gly Glu Asp Asn Ser He Pro Glu Ser Cys Pro Asp 
1 5 10 15 

Phe Cys Cys Gly Ser Cys Ser Ser Gin Tyr Cys Cys Ser Asp Val Leu 
20 25 30 

Lys Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser 
35 40 45 

Arg Phe 
50 



<210> 55 
<211> 56 
<212> PRT 

<213> Homo sapiens 

<400> 55 ^ _ _ 

Thr Val Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys 
1 5 10 15 

Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val 
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20 



25 



30 



Ser Asn Thr Thr Thr Thr Thr Val Val His Thr Ala Tyr Pro Gin Pro 
35 40 45 



Gin Pro Val Ala Pro Ser Tyr Pro 
50 55 



<210> 56 

<211> 1858 

<212> DNA 

<213> Mus musculus 



<400> 56 

gtcgacccac gcgtccgcgc ggaggttgcg 
tgttcgtccg tccgttggtc tgtcccgcca 
tattgctgct gctgttgctg ctgccgccgc 
cctttggtga agacaattcg atcccagtgt 
ccaaccaata ctgctgctcg gacgtgctga 
ctgagccaga gtccagcaga ttttccaccc 
cagcgctgaa atttcgatcc agttttgaca 
tcgccattgg cgtgaccatc tttgtggtgt 
gctcctgctg ctgtctgtat aagatgtgct 
caactactac cgtggttcat gccccttacc 
gctatcctgg accaacatac cagggctacc 
gcagcaccct acccaacgca gtacccacca 
ccctaccatg agtccttggc tggagccagc 
tccctaaaga caattccctg aacctgcccc 
gtgagtgagt gatacgcaga gttctttact 
catgtggctt cctctgctgt tgaccaggta 
caacctgttt tcttcctcac ttgaaattgt 
aataaggtag gaggtatttc ccacgtcacc 
taggagagca agctttttgc gggtacagag 
gctaggcctt tattcccagg gtttggctgc 
caccaggtga caaggggact cagtggcagg 
tccatcagct gtctgtctgg atgtcactgt 
tgttcactat tcaggctcca ctgggggaat 
cccaggccag ggacctcggt ctgtctacta 
ttactgttag ccaaacattt tgcctgtttt 
ggttgaaatc cctggttcct ggaggacaga 
gacaccatct tcatagaaat acctgactcc 
gggacaccaa ggccaatggg ttatctggac 
ctccggccca gatgtgaata cctccatatt 
tagctttgtg ttgattggtg tttctgagca 
tgcaacattg gaaaaaaaaa aaaaaaaaaa 



gcggcaccgt ggtcttgggc ttggtccgtc 60 
tggctgcgcc ggcgccctct ctgtggaccc 120 
ctccgggtgc ccatggtgag ctgtgcaggc . 180 
tctgtcctga tttctgttgt ggttcctgtt 240 
ggaaaatcca gtggaatgag gaaatgtgtc 300 
ccgcggagga gacacccgaa catctgggtt 360 
gtgaccctat gtcagggttc ggagcgaccg 420 
ttattgccac tatcatcatc tgcttcacct 480 
gcccccaacg ccctgtcgtg accaacacca 54 0 
ctcagcctca acctcaacct gtggccccca 600 
atcccatgcc ccccccagcc aggaatgcca 660 
ccctacctgg cccagcccac agggccgcca 72 0 
cagcctccat acaacccgac ctacatggat 780 
cagcctcttt ggctgccatt tatgtcgtgt 840 
gctgtctgtg gtgtgtgtgc cttgtctaga 900 
ggcgcaagtc ttaccagtgt gggtcgggac 960 
actttctgaa atttcaagca aattaaaaac 1020 
ccaaggtgac cagccatggc ctgtcatact 1080 
caggctttgg ggggtaacca gctagctgct 1140 
attggcagtg aggcaggtgg ctgggggtga 1200 
gggtcacacc aggcagaaca ccatacactc 1260 
ccttcccggg gctgtataga gggccacatg 1320 
tttcctacct ttgctggctt ggctcctgct 1380 
cacactctgg tttctccctg cactgtcttt 1440 
ctgtctccag atgtgtgata attggtgtga 1500 
caacctgacc tccgactgtc agtttccctt 1560 
tgtaccacag tccagtttgt cccagtagca 1620 
caaaggtggg gtggagggcc tagatggtat 168 0 
ccctgttggt tcctgtttca ctggctgttt 1740 
ttcagactcc gcaccctcat ttctaataaa 1800 
aaaaaaaaaa aaaaaaaagg gcggccgc 1858 



<210> 57 

<211> 639 

<212> DNA 

<213> Mus musculus 



<400> 57 

atggctgcgc cggcgccctc tctgtggacc 
cctccgggtg cccatggtga gctgtgcagg 
ttctgtcctg atttctgttg tggttcctgt 
aggaaaatcc agtggaatga ggaaatgtgt 
cccgcggagg agacacccga acatctgggt 



ctattgctgc tgctgttgct gctgccgccg 60 

ccctttggtg aagacaattc gatcccagtg 120 

tccaaccaat actgctgctc ggacgtgctg 180 

cctgagccag agtccagcag attttccacc 240 

tcagcgctga aatttcgatc cagttttgac 300 
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agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 

<210> 58 

<211> 213 

<212> PRT 

<213> Mus musculus 

<400> 58 

Met Ala Ala Pro Ala Pro Ser Leu Trp Thr Leu Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Pro Pro Pro Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe 
20 25 30 

Gly Glu Asp Asn Ser He Pro Val Phe Cys Pro Asp Phe Cys Cys Gly 
35 40 45 

Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp Val Leu Arg Lys He Gin 
50 55 60 

Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Thr 
65 70 75 80 

Pro Ala Glu Glu Thr Pro Glu His Leu Gly Ser Ala Leu Lys Phe Arg 
85 90 95 

Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 
100 105 110 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 
130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro Tyr 
145 150 155 160 

Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 
165 170 175 

Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 
180 185 190 

Thr Leu Pro Asn Ala Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 
195 200 205 

Ala Ala Thr Leu Pro 
210 



<210> 59 
<211> 26 
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<212> PRT 

<213> Mus musculus 

<400> 59 

Met Ala Ala Pro Ala Pro Ser Leu 
1 5 

Leu Leu Pro Pro Pro Pro Gly Ala 
20 



Trp Thr Leu Leu Leu Leu Leu Leu 
10 15 

His Gly 
25 



<210> 60 

<211> 187 

<212> PRT 

<213> Mus musculus 

<400> 60 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Val Phe Cys 
15 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 

Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala He Gly Val Thr He Phe Val Val Phe He 
85 90 95 

Ala Thr He He He Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys 
100 105 HO 

Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr 
115 120 125 

Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro 
130 135 140 

Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro Pro 
145 150 155 160 

Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr Leu 
165 170 175 

Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
180 185 



<210> 61 

<211> 86 

<212> PRT 

<213> Mus musculus 
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<400> 61 

Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn Ser lie Pro Val Phe Cys 
1 * 5 10 15 

Pro Asp Phe Cys Cys Gly Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp 
20 25 30 

Val Leu Arg Lys He Gin Trp Asn Glu Glu Met Cys Pro Glu Pro Glu 
35 40 45 

Ser Ser Arg Phe Ser Thr Pro Ala Glu Glu Thr Pro Glu His Leu Gly 
50 55 60 

Ser Ala Leu Lys Phe Arg Ser Ser Phe Asp Ser Asp Pro Met Ser Gly 
65 70 75 80 

Phe Gly Ala Thr Val Ala 
85 



<210> 62 
<211> 25 
<212> PRT 

<213> Mus musculus 
<400> 62 

He Gly Val Thr He Phe Val Val Phe He Ala Thr He He He Cys 
15 10 15 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr 
20 25 



<210> 63 

<211> 76 

<212> PRT 

<213> Mus 



musculus 



<400> 63 

Lys Met Cys Cys Pro Gin Arg Pro Val Val Thr Asn Thr Thr Thr Thr 
! 5 10 15 

Thr Val Val His Ala Pro Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala 
20 25 30 

Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met Pro Pro 
35 40 45 

Pro Ala Arg Asn Ala Ser Ser Thr Leu Pro Asn Ala Val Pro Thr Thr 
50 55 60 

Leu Pro Gly Pro Ala His Arg Ala Ala Thr Leu Pro 
65 70 75 



<210> 64 
<211> 50 
<212> PRT 
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<213> Mus musculus 



<400> 64 

Cys Pro Asp Phe Cys Cys Gly Ser 
1 5 

Asp Val Leu Arg Lys He Gin Trp 
20 

Glu Ser Ser Arg Phe Ser Thr Pro 
35 40 



Cys Ser Asn Gin Tyr Cys Cys Ser 
10 15 

Asn Glu Glu Met Cys Pro Glu Pro 
25 30 

Ala Glu Glu Thr Pro Glu His Leu 
45 



Gly Ser 
50 



<210> 65 

<211> 56 

<212> PRT 

<213> Mus musculus 



<400> 65 

Cys Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin 
15 10 15 

Arg Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro 
20 25 30 

Tyr Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro 
35 40 45 



Thr Tyr Gin Gly Tyr His Pro Met 
50 55 



<210> 66 

<211> 1927 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 



<400> 66 

ccccgcctcc aaagctaacc ctcgggcttg 
actctggcac cactctccag gctgccatgg 
ttttgtcatg gtcgggaccc ctccaaggac 
gccgactagc tgctttagag gaacggctgg 
ctgctgagct gcgggacttc aagaacaaga 
gagcgggagg cactcagaac tgaggccgac 
cgggaggtag actatctgga gacccagaac 
aaggttgact ggaggccctg ggaccaaagg 
ggtgacagac tgtggctaca caatctctca 
tggtggccca gctggtctat ggaccaagga 
gttagatggg acacagaatg acacagcctt 
tgccatggct gcccggaaag cttcccgagt 



aggggaagan gctgactgta cgttccttct 60 
ggcccagcac ccctctcctc atcttgttcc 120 
agcagcacca ccttgtggag tacatggaac 180 
cccagtgcca ggaccagagt agtcggcatg 240 
tgctngccac tgctggaggt ggcagagaag 300 
accatctccg ggagagtgga tcgtctggag 3 60 
ccagctctgc cctgtgtaga gtttgatgag 420 
caagggaaga aggaatgaga agtacgatat 4 80 
agtgagatca atgaagattc tgaagcgatt 540 
tccactgggg caaacagaga agatctacgt 600 
tgtcttccca aggctgcgtg acttcaccct 660 
ccgggtgccc ttcccctggg taggcacagg 720 
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gcagctggta tatggtggct ttctttattt tgctcggagg cctcctggaa gacctggtgg 780 

aggtggtgag atggagaaca ctttgcagct aatcaaattc cacctggcaa accgaacagt 84 0 

ggtggacagc tcagtattcc cagcagaggg gctgatcccc ccctacggct tgacagcaga 900 

cacctacatc gacctggcag ctgatgagga aggtctttgg gctgtctatg ccacccggga 960 

ggatgacagg cacttgtgtc tggccaagtt agatccacag acactggaca cagagcagca 1020 

gtgggacaca ccatgtccca gagagaatgc tgaggctgcc tttntcatct gtgggaccct 1080 

ctatgtcgtc tataacaccc gtcctgccag tcgggcccgc atccagtgct cctttgatgc 1140 

cagcggaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatcgctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 162 0 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctcaggaa caggactttc 174 0 

tccacattgt tttgtattgc aacattttgc attaaaaagg aaaatccana aaaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1860 

aaactgcggc cgctgtccct tctgtcgtct tctcgcagcc gtacccttct gtcgtcttct 192 0 

cgcagcc 1927 

<210> 67 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Met Val Gly Lys Met Trp Pro Val Leu Trp Thr Leu Cys Ala Val Arg 

15 10 15 

Val Thr Val Asp Ala He Ser Val Glu Thr Pro Gin Asp Val Leu Arg 

20 25 30 

Ala Ser Gin Gly Lys Ser Val Thr Leu Pro Cys Thr Tyr His Thr Ser 

35 40 45 

Thr Ser Ser Arg Glu Gly Leu He Gin Trp Asp Lys Leu Leu Leu Thr 

50 55 60 

His Thr Glu Arg Val Val He Trp Pro Phe Ser Asn Lys Asn Tyr He 

65 70 75 80 

His Gly Glu Leu Tyr Lys Asn Arg Val Ser He Ser Asn Asn Ala Glu 

85 90 95 

Gin Ser Asp Ala Ser He Thr He Asp Gin Leu Thr Met Ala Asp Asn 

100 105 110 

Gly Thr Tyr Glu Cys Ser Val Ser Leu Met Ser Asp Leu Glu Gly Asn 

115 120 125 

Thr Lys Ser Arg Val Arg Leu Leu Val Leu Val Pro Pro Ser Lys Pro 

130 135 140 

Glu Cys Gly He Glu Gly Glu Thr He He Gly Asn Asn He Gin Leu 

145 150 155 160 



42 



WO 00/78808 



PCT/US00/16883 



Thr Cys Gin Ser Lys Glu Gly Ser Pro Thr Pro Gin Tyr Ser Trp Lys 
165 170 175 

Arg Tyr Asn lie Leu Asn Gin Glu Gin Pro Leu Ala Gin Pro Ala Ser 
180 185 190 

Gly Gin Pro Val Ser Leu Lys Asn lie Ser Thr Asp Thr Ser Gly Tyr 
195 200 205 

Tyr lie Cys Thr Ser Ser Asn Glu Glu Gly Thr Gin Phe Cys Asn lie 
210 215 220 

Thr Val Ala Val Arg Ser Pro Ser Met Asn Val Ala Leu Tyr Val Gly 
225 230 235 240 

He Ala Val Gly Val Val Ala Ala Leu He He He Gly He He He 
245 250 255 

Tyr Cys Cys Cys Cys Arg Gly Lys Asp Asp Asn Thr Glu Asp Lys Glu 
260 265 270 

Asp Ala Arg Pro Asn Arg Glu Ala Tyr Glu Glu Pro Pro Glu Gin Leu 
275 280 285 

Arg Glu Leu Ser Arg Glu Arg Glu Glu Glu Asp Asp Tyr Arg Gin Glu 
290 295 300 

Glu Gin Arg Ser Thr Gly Arg Glu Ser Pro Asp His Leu Asp Gin 
305 310 315 



<210> 68 

<211> 2793 

<212> DNA 

<213> Homo sapiens 



<400> 68 

ctaccccttt gtgagcagtc taggactttg tacacctgtt aagtagggag aaggcagggg 60 

aggtggctgg tttaagggga acttgaggga agtagggaag actcctcttg ggacctttgg 12 0 

agtaggtgac acatgagccc agccccagct cacctgccaa tccagctgag gagctcacct 180 

gccaatccag ctgaggctgg gcagaggtgg gtgagaagag ggaaaattgc agggacctcc 240 

agttgggcca ggccagaagc tgctgtagct ttaaccagac agctcagacc tgtctggagg 3 00 

ctgccagtga caggttaggt ttagggcaga gaagaagcaa gaccatggtg gggaagatgt 360 

ggcctgtgtt gtggacactc tgtgcagtca gggtgaccgt cgatgccatc tctgtggaaa 420 

ctccgcagga cgttcttcgg gcttcgcagg gaaagagtgt caccctgccc tgcacctacc 4 80 

acacttccac ctccagtcga gagggactta ttcaatggga taagctcctc ctcactcata 540 

cggaaagggt ggtcatctgg ccgttttcaa acaaaaacta catccatggt gagctttata 600 

agaatcgcgt cagcatatcc aacaatgctg agcagtccga tgcctccatc accattgatc 660 

agctgaccat ggctgacaac ggcacctacg agtgttctgt ctcgctgatg tcagacctgg 720 

agggcaacac caagtcacgt gtccgcctgt tggtcctcgt gccaccctcc aaaccagaat 7 80 

gcggcatcga gggagagacc ataattggga acaacatcca gctgacctgc caatcaaagg 84 0 

agggctcacc aacccctcag tacagctgga agaggtacaa catcctgaat caggagcagc 900 

ccctggccca gccagcctca ggtcagcctg tctccctgaa gaatatctcc acagacacat 960 

cgggttacta catctgtacc tccagcaatg aggaggggac gcagttctgc aacatcacgg 102 0 

tggccgtcag atctccctcc atgaacgtgg ccctgtatgt gggcatcgcg gtgggcgtgg 1080 

ttgcagccct cattatcatt ggcatcatca tctactgctg ctgctgccga gggaaggacg 114 0 

acaacactga agacaaggag gatgcaaggc cgaaccggga agcctatgag gagccaccag 12 00 

agcagctaag agaactttcc agagagaggg aggaggagga tgactacagg caagaagagc 1260 
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agaggagcac tgggcgtgaa tccccggacc acctcgacca gtgacaggcc agcagcagag 1320 
ggcggcggag gaagggttag gggttcattc tcccgcttcc tggcctccct tctcctttct 1380 
aagccctgtt ctcctgtccc tccatcccag acattgatgg ggacatttct tccccagtgt 1440 
cagctgtggg gaacatggct ggcctggtaa gggggtccct gtgctgatcc tgctgacctc 1500 
actgtcctgt gaagtaaccc ctcctggctg tgacacctgg tgcgggcctg gccctcactc 1560 
aagaccaggc tgcagcctcc acttccctcg tagttggcag gagctcctgg aagcacagcg 1620 
ctgagcatgg ggcgctccca ctcagaactc tccagggagg cgatgccagc cttggggggt 1680 
gggggctgtc ctgctcacct gtgtgcccag cacctggagg ggcaccaggt ggagggtttg 174 0 
cactccacac atctttcttg aatgaatgaa agaataagtg agtatgcttg ggccctgcat 1800 
tggcctggcc tccagctccc actccctttc caacctcact tcccgtagct gccagtatgt 1860 
tccaaaccct cctgggaagg ccacctccca ctcctgctgc acaggccctg gggagctttt 1920 
gcccacacac tttccatctc tgcctgtcaa tatcgtacct gtccctccag gcccatctca 1980 
aatcacaagg atttctctaa ccctatccta attgtccaca tacgtggaaa caatcctgtt 204 0 
actctgtccc acgtccaatc atgggccaca aggcacagtc ttctgagcga gtgctctcac 2100 
tgtattagag cgccagctcc ttggggcagg gcctgggcct catggctttt gctttccctg 2160 
aagccctagt agctggcgcc catcctagtg ggcacttaag cttaattggg gaaactgctt 222 0 
tgattggttg tgccttccct tctctggtct ccttgagatg atcgtagaca cagggatgat 2280 
tcccacccaa acccacgtat tcattcagtg agttaaacac gaattgattt aaagtgaaca 2340 
cacacaaggg agcttgcttg cagatggtct gagttcttgt gtcctggtaa ttcctctcca 2400 
ggccagaata attggcatgt ctcctcaacc cacatggggt tcctggttgt tcctgcatcc 2460 
cgatacctca gccctggccc tgcccagccc atttgggctc tggttttctg gtggggctgt 2520 
cctgctgccc tcccacagcc tccttctgtt tgtcgagcat ttcttctact cttgagagct 2580 
caggcagcgt tagggctgct taggtctcat ggaccagtgg ctggtctcac ccaactgcag 2640 
tttactattg ctatcttttc tggatgatca gaaaaataat tccataaatc tattgtctac 2700 
ttgcgatttt ttaaaaaatg tatattttta tatatattgt taaatccttt gcttcattcc 2760 
aaatgctttc agtaataata aaattgtggg tgg 2793 

<210> 69 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Lys Thr Ala Leu Gly Glu Leu Leu Lys Pro Leu Asn Ser Glu Tyr Gly 
15 10 15 

Lys Val Ala Pro Gly Trp Gly Thr Thr Pro Leu Met Gly Val Phe Met 
20 25 30 

Ala Leu Phe Ala Val Phe Leu Leu lie lie Leu Glu lie Tyr Asn Ser 
35 40 45 

Ser Val Leu Leu 
50 



<210> 70 

<211> 1832 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 

<400> 70 

tgtggctgac gtcatctgga ggagatttgc tttctttttc tccaaaaggg gaggaaattg 60 
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aaactgcagt ggcccacgat gggaagaggg 
tgaaggcaga ggctaacatg gggttcggag 
gctgtctgtc gtcactatca tcatctgctt 
gtgccgccga ccacgtccgg ttgtcaccac 
ttatcctcag cctccaagtg tgccgcccag 
caccatgccg cctcagccag ggatgccagc 
ttacccagcc cagcccatgg gcccaccggc 
cgcgccctam cccgscagcc agcctcctta 
ccctctgagc attccctggc ctctytggct 
gtggtgtgca ggcgcggttc cttacgcccc 
cttacgcccc atgtgtgctg tgtgtgtcct 
caagtgggga acaatccttg ccagagtggg 
tgaaattatg cttcctaaaa tctcaagcca 
ctgtgaggtg gcccctgaga ggtgggggcc 
gcttacccta gggtgaccaa gtagggcctg 
tgcagatgtg tcctggtttc ggcagcgtag 
ccggagttgg gggtacccgt tgcagagcca 
ggccaagttg gactttgatc ctttgggcag 
cctgttgggg atcaggcagc ctcctgatgc 
tgtacctgtc tgcctggact gtcccctgtc 
ccacatgcac acacagccta gctgccccca 
cttcccacag gtgagcaggg ctcctgtcca 
ttttcatttt attttagcca aacattttgc 
atgagactga aacccctggg ttgtggaggg 
ctgtgagtcc ctgcttcccg acaccagcct 
gtccacggtg ttctggcagc agggacacct 
ggtgtggggc cctggatggc agctctggcc 
tctattactg tttcaccaga gctgtcttag 
tctgtacact tgtttataat aaatgcaatc 
ggsggcgctc taaaaggatn ccccnaaggg 



gaaagcccag gggtacagga ggcctctggg 12 0 
cgaccttggc cgttggctga ccatctttgt 180 
cacctgctcc tgctgctgcc tttacaagac 240 
caccacatcc accactgtgg tgcatgcccc 300 
ctaccctgga ccaagctacc agggctacca 360 
agcaccctac ccaatgcagt acccaccacc 420 
ctaccacgag accctggctg gaggagcagc 4 80 
caacccggcc tacatggatg cccgaagcgg 540 
gccacttggt tatgttgtgt gtgtgcgtra 600 
atgtgtgctg tgtgtgtcca ggcacggttc 660 
gcctgtatat gtggcttcct ctgatgctga 720 
ctgggaccag actttgttct cttcctcacc 780 
aactcaaaga atggggtggt ggggggcacc 840 
tctccagggc acatctggag ttcttctcca 900 
tcacaccagg gtggcgcast ttctgtgtga 960 
ccagctgctg cttgaggcca tggctcgtcc 1020 
gggacatgat gcaggcgaag yttgggatct 1080 
atgtcccatt gctccctgga gcctgtcatg 114 0 
cagaacacct caggcagagc cctactcagc 1200 
cccgcatctc ccctgggacc agctggaggg 12 6 0 
gggagctctg ctgcccttgc tggccctgcc 1320 
ccagcacact cagttctctt ccctgcagtg 1380 
ctgttttctg tttcaaacat gatagttgat 1440 
aaattggctc agagatggac aacctggcaa 1500 
catggaatat gcaacaactc ctgtacccca 1560 
gggccaatgg gccatctgga ccaaaggtgg 1620 
cagacatgaa tacctcgtgt tcctcctccc 1680 
ctcaaatctg ttgtgtttct gagtctaggg 174 0 
gtttnggaaa aaaaananaa aaaaaaaagg 1800 
qq 1832 



<210> 71 
<211> 51 
<212> PRT 

<213> Mus musculus 



<400> 71 

Ser Pro Arg Ser Lys Pro Thr Ala 
1 5 

Pro Ser Ser Gin Val Phe Phe Gly 
20 

Ser Leu Lys Leu Thr Asn Leu Ser 
35 40 



Gin Tyr Gin Trp Glu Arg Leu Ala 

10 15 

Pro Ala Leu Asp Ala Val Arg Gly 
25 30 

He Ala Met Ser Gly Val Tyr Val 
45 



Cys Lys Ala 
50 



<210> 72 
<211> 2557 
<212> DNA 

<213> Homo sapiens 
<400> 72 

gaattccggg agaagtgacc agagcaattt ctgcttttca cagggcgggt ttctcaacgg 60 
tgacttgtgg gcagtgcctt ctgctgagcg agtcatggcc cgaaggcaga actaactgtg 120 
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cctgcagtct tcactctcag gatgcagccg aggtgggccc aaggggccac gatgtggctt 180 

ggagtcctgc tgacccttct gctctgttca agccttgagg gtcaagaaaa ctctttcaca 24 0 

atcaacagtg ttgacatgaa gagcctgccg gactggacgg tgcaaaatgg gaagaacctg 300 

accctgcagt gcttcgcgga tgtcagcacc acctctcacg tcaagcctca gcaccagatg 360 

ctgttctata aggatgacgt gctgttttac aacatctcct ccatgaagag cacagagagt 420 

tattttattc ctgaagtccg gatctatgac tcagggacat ataaatgtac tgtgattgtg 480 

aacaacaaag agaaaaccac tgcagagtac cagctgttgg tggaaggagt gcccagtccc 54 0 

agggtgacac tggacaagaa agaggccatc caaggtggga tcgtgagggt caactgttct 600 

gtcccagagg aaaaggcccc aatacacttc acaattgaaa aacttgaact aaatgaaaaa 660 

atggtcaagc tgaaaagaga gaagaattct cgagaccaga attttgtgat actggaattc 72 0 

cccgttgagg aacaggaccg cgttttatcc ttccgatgtc aagctaggat catttctggg 780 

atccatatgc agacctcaga atctaccaag agtgaactgg tcaccgtgac ggaatccttc 84 0 

tctacaccca agttccacat cagccccacc ggaatgatca tggaaggagc tcagctccac 900 

attaagtgca ccattcaagt gactcacctg gcccaggagt ttccagaaat cataattcag 960 

aaggacaagg cgattgtggc ccacaacaga catggcaaca aggctgtgta ctcagtcatg 1020 

gccatggtgg agcacagtgg caactacacg tgcaaagtgg agtccagccg catatccaag 1080 

gtcagcagca tcgtggtcaa cataacagaa ctattttcca agcccgaact ggaatcttcc 1140 

ttcacacatc tggaccaagg tgaaagactg aacctgtcct gctccatccc aggagcacct 1200 

ccagccaact tcaccatcca gaaggaagat acgattgtgt cacagactca agatttcacc 1260 

aagatagcct caaagtcgga cagtgggacg tatatctgca ctgcaggtat tgacaaagtg 1320 

gtcaagaaaa gcaacacagt ccagatagtc gtatgtgaaa tgctctccca gcccaggatt 1380 

tcttatgatg cccagtttga ggtcataaaa ggacagacca tcgaagtccg ttgcgaatcg 1440 

atcagtggaa ctttgcctat ttcttaccaa cttttaaaaa caagtaaagt tttggagaat 1500 

agtaccaaga actcaaatga tcctgcggta ttcaaagaca accccactga agacgtcgaa 1560 

taccagtgtg ttgcagataa ttgccattcc catgccaaaa tgttaagtga ggttctgagg 1620 

gtgaaggtga tagccccggt ggatgaggtc cagatttcta tcctgtcaag taaggtggtg 1680 

gagtctggag aggacattgt gctgcaatgt gctgtgaatg aaggatctgg tcccatcacc 1740 

tataagtttt acagagaaaa agagggcaaa cccttctatc aaatgacctc aaatgccacc 1800 

caggcatttt ggaccaagca gaaggctagc aaggaacagg agggagagta ttactgcaca 1860 

gccttcaaca gagccaacca cgcctccagt gtccccagaa gcaaaatact gacagtcaga 1920 

gtcattcttg ccccatggaa gaaaggactt attgcagtgg ttatcatcgg agtgatcatt 1980 

gctctcttga tcattgcggc caaatgttat tttctgagga aagccaaggc caagcagatg 2040 

ccagtggaaa tgtccaggcc agcagtacca cttctgaact ccaacaacga gaaaatgtca 2100 

gatcccaata tggaagctaa cagtcattac ggtcacaatg acgatgtcag aaaccatgca 2160 

atgaaaccaa taaatgataa taaagagcct ctgaactcag acgtgcagta cacggaagtt 2220 

caagtgtcct cagctgagtc tcacaaagat ctaggaaaga aggacacaga gacagtgtac 2280 

agtgaagtcc ggaaagctgt ccctgatgcc gtggaaagca gatactctag aacggaaggc 2340 

tcccttgatg gaacttagac agcaaggcca gatgcacatc cctggaagga catccatgtt 24 00 

ccgagaagaa cagataatcc ctgtatttca agacctctgt gcacttattt atgaacctgc 2460 

cctgctccca cagaacacag caattcctca ggctaagctg ccggttctta aatccatcct 2520 

gctaagttaa tgttgggtag aaagagatac agagggg 2557 

<210> 73 

<211> 738 

<212> PRT 

<213> Homo sapiens 

<400> 73 

Met Gin Pro Arg Trp Ala Gin Gly Ala Thr Met Trp Leu Gly Val Leu 
15 10 15 

Leu Thr Leu Leu Leu Cys Ser Ser Leu Glu Gly Gin Glu Asn Ser Phe 
20 25 30 

Thr He Asn Ser Val Asp Met Lys Ser Leu Pro Asp Trp Thr Val Gin 
35 40 45 

Asn Gly Lys Asn Leu Thr Leu Gin Cys Phe Ala Asp Val Ser Thr Thr 
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50 



55 



60 



Ser His Val Lys Pro Gin His Gin Met Leu Phe Tyr Lys Asp Asp Val 
65 70 75 80 

Leu Phe Tyr Asn lie Ser Ser Met Lys Ser Thr Glu Ser Tyr Phe He 
85 90 95 

Pro Glu Val Arg He Tyr Asp Ser Gly Thr Tyr Lys Cys Thr Val He 
100 105 110 

Val Asn Asn Lys Glu Lys Thr Thr Ala Glu Tyr Gin Leu Leu Val Glu 
115 120 125 

Gly Val Pro Ser Pro Arg Val Thr Leu Asp Lys Lys Glu Ala He Gin 
130 135 140 

Gly Gly He Val Arg Val Asn Cys Ser Val Pro Glu Glu Lys Ala Pro 
145 150 155 160 

He His Phe Thr He Glu Lys Leu Glu Leu Asn Glu Lys Met Val Lys 
165 170 175 

Leu Lys Arg Glu Lys Asn Ser Arg Asp Gin Asn Phe Val He Leu Glu 
180 185 190 

Phe Pro Val Glu Glu Gin Asp Arg Val Leu Ser Phe Arg Cys Gin Ala 
195 200 205 

Arg He He Ser Gly He His Met Gin Thr Ser Glu Ser Thr Lys Ser 
210 215 220 

Glu Leu Val Thr Val Thr Glu Ser Phe Ser Thr Pro Lys Phe His He 
225 230 235 240 

Ser Pro Thr Gly Met He Met Glu Gly Ala Gin Leu His He Lys Cys 
245 250 255 

Thr He Gin Val Thr His Leu Ala Gin Glu Phe Pro Glu He He He 
260 265 270 

Gin Lys Asp Lys Ala He Val Ala His Asn Arg His Gly Asn Lys Ala 
275 280 285 

Val Tyr Ser Val Met Ala Met Val Glu His Ser Gly Asn Tyr Thr Cys 
290 295 300 

Lys Val Glu Ser Ser Arg He Ser Lys Val Ser Ser He Val Val Asn 
305 310 315 320 

He Thr Glu Leu Phe Ser Lys Pro Glu Leu Glu Ser Ser Phe Thr His 
325 330 335 



Leu Asp Gin Gly Glu Arg Leu Asn Leu Ser Cys Ser He Pro Gly Ala 

340 345 350 

Pro Pro Ala Asn Phe Thr He Gin Lys Glu Asp Thr He Val Ser Gin 

355 360 365 
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Thr Gin Asp Phe Thr Lys He Ala Ser Lys Ser Asp Ser Gly Thr Tyr 
370 375 380 

He Cys Thr Ala Gly He Asp Lys Val Val Lys Lys Ser Asn Thr Val 
385 390 395 400 

Gin He Val Val Cys Glu Met Leu Ser Gin Pro Arg He Ser Tyr Asp 
405 410 415 

Ala Gin Phe Glu Val He Lys Gly Gin Thr He Glu Val Arg Cys Glu 
420 425 430 

Ser He Ser Gly Thr Leu Pro He Ser Tyr Gin Leu Leu Lys Thr Ser 
435 440 445 

Lys Val Leu Glu Asn Ser Thr Lys Asn Ser Asn Asp Pro Ala Val Phe 
450 455 460 

Lys Asp Asn Pro Thr Glu Asp Val Glu Tyr Gin Cys Val Ala Asp Asn 
465 470 475 480 

Cys His Ser His Ala Lys Met Leu Ser Glu Val Leu Arg Val Lys Val 
485 490 495 

He Ala Pro Val Asp Glu Val Gin He Ser He Leu Ser Ser Lys Val 
500 505 510 

Val Glu Ser Gly Glu Asp He Val Leu Gin Cys Ala Val Asn Glu Gly 
515 520 525 

Ser Gly Pro He Thr Tyr Lys Phe Tyr Arg Glu Lys Glu Gly Lys Pro 
530 535 540 

Phe Tyr Gin Met Thr Ser Asn Ala Thr Gin Ala Phe Trp Thr Lys Gin 
545 550 555 560 

Lys Ala Ser Lys Glu Gin Glu Gly Glu Tyr Tyr Cys Thr Ala Phe Asn 
565 570 575 

Arg Ala Asn His Ala Ser Ser Val Pro Arg Ser Lys He Leu Thr Val 
580 585 590 

Arg Val He Leu Ala Pro Trp Lys Lys Gly Leu He Ala Val Val He 
595 600 605 

He Gly Val He He Ala Leu Leu He He Ala Ala Lys Cys Tyr Phe 
610 615 620 

Leu Arg Lys Ala Lys Ala Lys Gin Met Pro Val Glu Met Ser Arg Pro 
625 630 635 640 

Ala Val Pro Leu Leu Asn Ser Asn Asn Glu Lys Met Ser Asp Pro Asn 
645 650 655 



Met Glu Ala Asn Ser His Tyr Gly His Asn Asp Asp Val Arg Asn His 
660 665 670 

Ala Met Lys Pro He Asn Asp Asn Lys Glu Pro Leu Asn Ser Asp Val 
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675 680 685 

Gin. Tyr Thr Glu Val Gin Val Ser Ser Ala Glu Ser His Lys Asp Leu 
690 695 700 

Gly Lys Lys Asp Thr Glu Thr Val Tyr Ser Glu Val Arg Lys Ala Val 
705 710 715 720 

Pro Asp Ala Val Glu Ser Arg Tyr Ser Arg Thr Glu Gly Ser Leu Asp 
725 730 735 



Gly Thr 



<210> 74 
<211> 601 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 74 

gnnnnnnagg tntanagncn cctttacncc 
gctgtggagc aagaagcaac ccgaagctag 
gttggggtag gagtgggagc agggccagca 
cagctgggag agctggggag ccgggaaggg 
ctgggcctcc tgggtcatca ccatgaggcc 
atcaggctct cctcctctgg acgacaacaa 
cctcccaggc acaccaggcc accacggcag 
tggccgcgac ggtgcacccg gagctccggg 
acctgggcca cgtngggagc ccgggccgcg 
gcctggnggg gaatgctcgg tgccccacga 
c 



gccgcggacg cgtgggcgga cgcgtggggt 60 
gagtctgtca gcgagggcag gggctgcctg 120 
ggagggtctg aggaagccat tcaaagcgag 180 
cctacagact acaagagagg atcctggcgt 24 0 
acttcttgcc ctgctgcttc tgggtctggc 300 
gatccccagc ctgtgtcccg ggcagcccgg 360 
ccaaggcctg cctggccgtg acggccgtga 420 
agagaaaggc gagggcggga gaccgggact 480 
tggagaggca ggacctgtgg gggctatcgg 54 0 
tcagcttcag tgccaagcga tcagaaagcc 600 

601 



<210> 75 
<211> 732 
<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 75 

gngngttnnn ttccncctcc gacttaaggc 
cttcttcctt ttgtcatggc cgggacccct 
catggaacgc cgactagctg ccttagagga 
tcggcatgct gctgagcttc gggacttcaa 
agagaaggag cgggaaacac tcagaaccga 
tcttgaacgg gaagtagact acctggagac 
ggatgagaag gtgactggag gccctggaac 
cgatatggtg acagactgta gctacacaat 
gcggtttggt ggctcagctg gcctatggac 



tgccatgggg cccagtgctc ctctgctcct 60 
tcagggacag cagcaccacc ttgtggagta 12 0 
gcggctggca cagtgccagg atcagagcag 180 
aaacaagatg ctgcctctac tggaggtggc 24 0 
ggcagacagc atttcaggaa gagtggaccg 300 
acagaaccca gctttgccct gtgtagaact 360 
caaaggcaag ggccggagaa atgagaaata 420 
ctctcaggtg aggtcaatga agatcctgaa 480 
caaggatcca ctggggccag canagaagat 54 0 
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ctacgtgtta gacggnacgc agaacgacac 

ccctcaccat ggctgccgca aagttccgaa 

aactggtgtn tgtggcttcc tttttatctc 
nggtggnaaa at 



ggccttcgtt ttccganggt gcgtgactta 600 
tcgggtgccc ttncctgggt agnacaagaa 660 
aangcntctg gaggaacttg nanggggggn 720 

732 



<210> 76 

<211> 177 

<212> PRT 

<213> Homo sapiens 



<400> 76 



Gin 


Leu Gin 


Leu His 


Leu 


Pro 


Ala 


Asn 


Arg 


Leu 


Gin 


Ala 


val 


Glu 


Glu 


1 




5 










10 










15 




Gly Glu Ser 


Gly Ala 


Ser 


Ala 


Trp 


Tyr 


Thr 


Leu 


His 


Arg 


Glu 


Val 


Ser 






20 








25 










30 






Ser 


Ser Gin 
35 


Pro Trp 


Glu 


Val 


Pro 
40 


Phe 


Val 


Met 


Trp 


Phe 
45 


Phe 


Lys 


Gin 


Lys 


Glu Lys 
50 


Glu Asp 


Gin 


Val 
55 


Leu 


Ser 


Tyr 


He 


Asn 
60 


Gly 


Val 


Thr 


Thr 


Ser 


Lys Pro 


Gly Val 


Ser 


Leu 


Val 


Tyr 


Ser 


Met 


Pro 


Ser 


Arg 


Asn 


Leu 


65 




70 










75 










80 


Ser 


Leu Arg 


Val Glu 


Gly 


Leu 


Gin 


Glu 


Lys 


Asp 


Ser 


Gly 


Pro 


Tyr 


Ser 




85 










90 










95 




Cys 


Ser Val 


Asn Val 


Gin 


Asp 


Lys 


Gin 


Gly 


Lys 


Ser 


Arg 


Gly 


His 


Ser 




100 








105 










110 






He 


Lys Thr 
115 


Leu Glu 


Leu 


Asn 


Val 
120 


Leu 


Val 


Pro 


Pro 


Ala 
125 


Pro 


Pro 


Ser 


Cys 


Arg Leu 
130 


Gin Gly 


Val 


Pro 
135 


His 


Val 


Gly 


Ala 


Asn 
140 


Val 


Thr 


Leu 


Ser 


Cys 


Gin Ser 


Pro Arg 


Ser 


Lys 


Pro 


Ala 


Val 


Gin Tyr 


Gin 


Trp 


Asp 


Arg 


145 






150 










155 










160 


Gin 


Leu Pro 


Ser Phe 
165 


Gin 


Thr 


Phe 


Phe 


Ala 
170 


Pro 


Ala 


Leu 


Asp 


Val 
175 


He 



Arg 



<210> 77 

<211> 735 

<212> DNA 

<213> Homo sapiens 

<400> 77 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 6 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 12 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 18 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 24 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 3 0 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 36 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 42 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 48 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 54 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 60 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 66 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 72 

ccaaaggtag ttccc 73 

<210> 78 
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<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 78 

Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr 

15 10 15 

Val Cys 



<210> 79 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 

15 10 15 

Val Ser Thr Gly Pro Gly 
20 

<210> 80 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 80 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 
gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 
catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 
gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 24 0 
ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 
ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 36 0 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 
gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 4 80 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc ccttgcctct ttcttccagt 540 
ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 
ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 
tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 
tctttgct 728 

<210> 81 

<211> 206 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 
<400> 81 

Met lie Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 

15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 

20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 
35 40 45 
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Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser Ser Ser Gin 

50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr Ser Lys Pro 

85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 

100 105 HO 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 

115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser lie Lys Thr 

130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 

165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 

180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg 
195 200 205 

<210> 82 
<211> 217 
<212> PRT 

<213> Homo sapiens 
<400> 82 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 

15 10 15 

Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 

20 25 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 

35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr He Asn Gly Val Thr Thr 

50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 

85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 

100 105 HO 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 

115 120 125 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 

130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 

165 170 175 

Arg Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 

180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 

195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly 
210 215 

<210> 83 
<211> 220 
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<212> PRT 

<213> Homo sapiens 

<400> 83 

Gin Leu Gin Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu 

1 5 10 15 

Gly Glu Ser Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Val Ser 

20 25 30 

Ser Ser Gin Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin 

35 40 45 

Lys Glu Lys Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr 

50 55 60 

Ser Lys Pro Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu 
65 70 75 80 

Ser Leu Arg Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser 

85 90 95 

Cys Ser Val Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser 

100 105 HO 

He Lys Thr Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser 

115 120 12 5 

Cys Arg Leu Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser 

130 135 140 

Cys Gin Ser Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg 
145 150 155 160 

Gin Leu Pro Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He 

165 170 175 

Arq Gly Ser Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val 

180 185 190 

Tyr Val Cys Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val 

195 200 205 

Thr Leu Glu Val Ser Thr Gly Pro Gly Ala Ala Val 
210 215 220 

<210> 84 

<211> 202 

<212> PRT 

<213> Homo sapiens 

<400> 84 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 

1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 " 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
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145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys lie Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg 
195 200 



<210> 85 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 


85 












Ala 


Ser 


Arg 


Ala 


Arg 


He Gin Cys 


1 








5 






Thr 


Pro 


Glu 


Arg 


Ala 


Ala Leu 


Pro 








20 








His 


Ala 


Ser 


Leu Arg Tyr Asn 


Pro 






35 








40 


Asp 


Asp 


Gly 


Tyr 


Gin 


He Val 


Tyr 




50 








55 




Glu 


Glu 


Val 










65 















Ser Phe Asp Ala Ser Gly Thr Leu 

10 15 
Tyr Phe Pro Arg Arg Tyr Gly Ala 
25 30 
Arg Glu Arg Gin Leu Tyr Ala Trp 
45 

Lys Leu Glu Met Arg Lys Lys Glu 
60 



<210> 86 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<400> 86 

Val Pro Phe Pro Trp Val. Gly Thr 

1 5 
Leu Tyr Phe 



Gly Gin Leu Val Tyr Gly Gly Phe 
10 15 



<210> 87 
<211> 17 
<212> PRT 

<213> Homo sapiens 



<400> 87 

Ala Glu Ala Ala Phe Val He Cys 

1 5 
Thr 



Gly Thr Leu Tyr Val Val Tyr Asn 
10 15 



<210> 88 
<211> 99 
<212> PRT 
<213> Homo 



sapiens 



<400> 88 

Ala Arg Arg Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn 

! ^ 5 10 I 5 

Thr Leu Gin Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp 

20 25 30 

Ser Ser Val Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 45 

Ala Asp Thr Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala 

50 55 60 

Val Tyr Ala Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu 
65 70 75 80 

Asp Pro Gin Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro 
85 90 95 

Arg Glu Asn 



<210> 89 
<211> 320 
<212> PRT 

<213> Homo sapiens 
<400> 89 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 

210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 J 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 - 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 ' 310 315 320 

<210> 90 
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<211> 385 
<212> PRT 
<213> Homo sapiens 



<400> 90 



Gin Gin His His 


Leu 


Val 


Glu 


Tyr 


1 


5 








Glu Glu Arg Leu 


Ala 


Gin 


Cys 


Gin 


20 










Glu Leu Arg Asp 


Phe 


Lys 


Asn 


Lys 


35 








40 


Glu Lys Glu Arg 


Glu 


Ala 


Leu 


Arg 


50 






55 




Arg Val Asp Arg 


Leu Glu Arg 


Glu 


65 




70 






Pro Ala Leu Pro 


Cys 


Val 


Glu 


Phe 




85 








Gly Thr Lys Gly 


Lys 


Gly Arg 


Arg 


100 










Asp Cys Gly Tyr 


Thr 


He 


Ser 


Gin 


115 








120 


Arg Phe Gly Gly 


Pro 


Ala 


Gly 


Leu 


130 






135 




Thr Glu Lys lie 


Tyr Val 


Leu 


Asp 


145 




150 






Val Phe Pro Arg 


Leu Arg Asp 


Phe 




165 








Ala Ser Arg Val 


Arg 


Val 


Pro 


Phe 


180 










Val Tyr Gly Gly 


Phe 


Leu 


Tyr 


Phe 


195 








200 


Gly Gly Gly Gly 


Glu 


Met 


Glu 


Asn 


210 






215 




Leu Ala Asn Arg 


Thr 


Val 


Val 


Asp 


225 




230 






Leu lie Pro Pro 


Tyr Gly 


Leu 


Thr 




245 








Ala Asp Glu Glu 


Gly Leu Trp 


Ala 


260 










Arg His Leu Cys 


Leu 


Ala 


Lys 


Leu 


275 








280 


Gin Gin Trp Asp 


Thr 


Pro 


Cys 


Pro 


290, 






295 




Val lie' Cys Gly 


Thr 


Leu 


Tyr 


Val 


305 




310 






Arg Ala Arg lie 


Gin 


Cys 


Ser 


Phe 




325 








Glu Arg Ala Ala 


Leu 


Pro Tyr 


Phe 


340 










Ser Leu Arg Tyr 


Asn 


Pro 


Arg 


Glu 


355 








360 


Gly Tyr Gin He 


Val 


Tyr Lys 


Leu 


370 






375 





Val 



385 

<210> 91 
<211> 728 



Met 


Glu 


Arg 


Arg 


Leu 


Ala 


Ala 


Leu 




10 










15 




Asp 


Gin 


Ser 


Ser 


Arg 


His 


Ala 


Ala 


25 










30 






Met 


Leu 


Pro 


Leu 


Leu 


Glu 


Val 


Ala 










45 








Thr 


Glu 


Ala 


Asp 


Thr 


lie 


Ser 


Gly 








60 










Val 


Asp Tyr 


Leu 


Glu 


Thr 


Gin 


Asn 






75 










80 


Asp 


Glu 


Lys 


Val 


Thr 


Gly 


Gly 


Pro 




90 










95 




Asn 


Glu 


Lys 


Tyr 


Asp 


Met 


Val 


Thr 


105 










110 






Val 


Arg Ser 


Met 


Lys 


He 


Leu 


Lys 










125 








Trp 


Thr 


Lys 


Asp 


Pro 


Leu 


Gly 


Gin 








140 










Gly 


Thr 


Gin 


Asn 


Asp 


Thr 


Ala 


Phe 






155 










160 


Thr 


Leu 


Ala 


Met 


Ala 


Ala 


Arg 


Lys 




170 










175 




Pro 


Trp 


Val 


Gly 


Thr 


Gly 


Gin 


Leu 


185 










190 






Ala 


Arg 


Arg 


Pro 


Pro 


Gly 


Arg 


Pro 










205 








Thr 


Leu 


Gin 


Leu 


He 


Lys 


Phe 


His 








220 










Ser 


Ser 


Val 


Phe 


Pro 


Ala 


Glu 


Gly 






235 










240 


Ala 


Asp Thr 


Tyr 


He 


Asp 


Leu 


Ala 




250 










255 




Val 


Tyr Ala 


Thr 


Arg 


Glu 


Asp 


Asp 


265 










270 






Asp 


Pro 


Gin 


Thr 


Leu 


Asp 


Thr 


Glu 










285 








Arg 


Glu 


Asn 


Ala 


Glu 


Ala 


Ala 


Phe 








300 










Val 


Tyr 


Asn 


Thr 


Arg 


Pro 


Ala 


Ser 






315 










320 


Asp 


Ala 


Ser 


Gly 


Thr 


Leu 


Thr 


Pro 




330 










335 




Pro 


Arg Arg 


Tyr 


Gly 


Ala 


His 


Ala 


345 










350 






Arg 


Gin 


Leu 


Tyr 


Ala 


Trp 


Asp 


Asp 










365 








Glu 


Met Arg 


Lys 


Lys 


Glu 


Glu 


Glu 



380 
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<212> DNA 

<213> Homo sapiens 



<400> 91 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 
gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 
catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg tgcgcccggg 180 
gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 
ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 300 
ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 
cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 
gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 
gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 
ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 
ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 
tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 



tctttgct 

<210> 92 
<211> 69 
<212> PRT 

<213> Homo sapiens 

< 400> 92 _ r^l 

Arg Pro Ala Ser Arg Ala Arg lie Gin Cys Ser Phe Asp Ala Ser Gly 

15 10 15 

Thr Leu Thr Pro Glut Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg Tyr 

20 25 30 

Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr 

35 40 45 

Ala Trp Asp Asp Gly Tyr Gin lie Val Tyr Lys Leu Glu Met Arg Lys 

50 55 60 

Lys Glu Glu Glu Val 
65 

<210> 93 
<211> 202 
<212> PRT 

<213> Mus musculus 



4uu> y.* 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 €0 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 



60 
120 



360 
420 
480 
540 



720 
728 
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130 










135 




Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe Gly Gly 


Ser 


145 








150 






Asp 


Pro 


Leu 


Gly 


Pro 


Ala Glu Lys He 


Tyr 








165 






170 


Asn 


Asp 


Thr 


Ala 


Phe 


Val 


Phe Pro Arg 


Leu 






180 






185 




Met 


Ala 


Ala 
195 


Arg 


Lys 


Ala 


Ser Arg He 
200 


Arg 



140 

Val Gly Leu Trp Thr Lys 
155 160 
Val Leu Asp Gly Thr Gin 
175 

Arg Asp Phe Thr Leu Ala 
190 



<210> 94 
<211> 69 
<212> PRT 

<213> Mus musculus 



Arg Pro Ala Ser Arg Ala Arg lie Gin Cys Ser Phe Asp Ala Ser Gly 

1 5 10 ^ ns 

Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg Tyr 

20 25 30 

Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr 

35 40 45 

Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys Lys 

50 55 60 

Lys Glu Glu Glu Val 
65 

<210> 95 
<211> 19 
<212> PRT 

<213> Mus musculus 
<400> 95 



< vJl > Pro Phe Pro Trp Val Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe 
15 10 15 

Leu Tyr Tyr 



<210> 96 
<211> 16 
<212> PRT 

<213> Mus musculus 

^l^Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn Thr 
15 10 15 

<210> 97 
<211> 99 
<212> PRT 

<213> Mus musculus 

< Ala > A^g Arg Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn 
1 5 10 15 

Thr Leu Gin Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp 

20 25 30 

Ser Ser Val Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr 
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35 40 
Ala Asp Thr Tyr He Asp Leu Ala 

50 55 
Val Tyr Ala Thr Arg Asp Asp Asp 
65 70 
Asp Pro Gin Thr Leu Asp Thr Glu 
85 

Arg Glu Asn 



45 

Ala Asp Glu Glu Gly Leu Trp Ala 
60 

Arg His Leu Cys Leu Ala Lys Leu 

75 80 
Gin Gin Trp Asp Thr Pro Cys Pro 
90 95 



<210> 98 

<211> 320 

<212> PRT 

<213> Mus musculus 

< 400> 98 *u 
Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Levi Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 

65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 

145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 

225 * 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 

305 310 315 320 

<210> 99 
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<211> 299 
<212> PRT 
<213> Mus musculus 

<400> 99 

Gin Gin His His Leu Val Glu Tyr Met Glu Arg Arg Leu Ala Ala Leu 

15 10 15 

Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser Ser Arg His Ala Ala 

20 25 30 

Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro Leu Leu Glu Val Ala 

35 40 45 

Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala Asp Ser He Ser Gly 

50 " 55 60 

Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn 
65 70 75 80 

Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys Val Thr Gly Gly Pro 

85 90 95 

Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr 

100 105 HO 

Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser Met Lys He Leu Lys 

115 120 125 

Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys Asp Pro Leu Gly Pro 

130 * 135 140 

Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala Phe 
145 ' 150 155 160 

Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala Met Ala Ala Arg Lys 

165 170 175 

Ala Ser Arg He Arg Val Pro Phe Pro Trp Val Gly Thr Gly Gin Leu 

180 185 190 

Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg Pro Pro Gly Gly Pro 

195 200 205 

Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin Leu He Lys Phe His 

210 215 220 

Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val Phe Pro Ala Glu Ser 
225 230 235 240 

Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr Tyr He Asp Leu Ala 

245 250 255 

Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala Thr Arg Asp Asp Asp 

260 265 270 

Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin Thr Leu Asp Thr Glu 

275 280 285 

Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
290 295 

<210> 100 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 100 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 
gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 
catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 
gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 
ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgtcggggag tgctcggtgc 
ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 
cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 
gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 



60 
120 
180 
240 
300 
360 
420 
480 
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gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggto, caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 



tctttgct 728 

<210> 101 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 101 

atgaggccac tcctcgtcct gctgctcctg ggcctggcgg ccggctcgcc cccactggac 60 

gacaacaaga tccccagcct ctgcccgggg caccccggcc ttccaggcac gccgggccac 120 

catggcagcc agggcttgcc gggccgcgat ggccgcgacg gccgcgacgg cgcgcccggg 180 

gctccgggag agaaaggcga gggcgggagg cgggactgcc gggacctcga ggggaccccg 240 

ggccgcgagg agaggcggga cccgcggggc ccaccgggcc tgccggggag tgctcggtgc 3 00 

ctccgcgatc cgccttcagc gccaagcgct ccgagagccg ggtgcctccg ccgtctgacg 360 

cacccttgcc cttcgaccgc gtgctggtga acgagcaggg acattacgac gccgtcaccg 420 

gcaagttcac ctgccaggtg cctggggtct actacttcgc cgtccatgcc accgtctacc 480 

gggccagcct gcagtttgat ctggtgaaga atggcgaatc cattgcctct ttcttccagt 540 

ttttcggggg gtggcccaag ccagcctcgc tctcgggggg ggccatggtg aggctggagc 600 

ctgaggacca agtgtgggtg caggtgggtg tgggtgacta cattggcatc tatgccagca 660 

tcaagacaga cagcaccttc tccggatttc tggtgtactc cgactggcac agctccccag 720 



tctttgct 

<210> 102 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 102 



20 

Gly Leu Pro Gly 
35 

Arg Asp Gly Arg 
50 

Lys Gly Glu Gly 
65 

Gly Pro Arg Gly 

Glu Cys Ser Val 
100 

Ser Arg Val Pro 
115 

Leu Val Asn Glu 
130 

Cys Gin Val Pro 
145 



Leu Val 


Leu Leu Leu Leu Gly Leu Ala 


Ala 


Gly 


Ser 


5 


10 




15 




Asp Asn 


Lys lie Pro Ser Leu Cys Pro 


Gly 


His 


Pro 


25 


30 






Thr Pro 


Gly His His Gly Ser Gin Gly 


Leu 


Pro 


Gly 




40 45 








Asp Gly 


Arg Asp Gly Val Pro Gly Ala 


Pro 


Gly 


Glu 




55 60 








Gly Arg 


Pro Gly Leu Pro Gly Pro Arg 


Gly 


Asp 


Pro 


70 


75 






80 


Glu Ala 


Gly Pro Ala Gly Pro Thr Gly 


Pro 


Ala 


Gly 


85 


90 




95 




Pro Pro 


Arg Ser Ala Phe Ser Ala Lys 


Arg 


Ser 


Glu 




105 


110 






Pro Pro 


Ser Asp Ala Pro Leu Pro Phe 


Asp 


Arg 


Val 




120 125 








Gin Gly 


His Tyr Asp Ala Val Thr Gly 


Lys 


Phe 


Thr 




135 140 








Gly Val 


Tyr Tyr Phe Ala Val His Ala 


Thr 


Val 


Tyr 


150 


155 






160 


Gin Phe 


Asp Leu Val Lys Asn Gly Glu 


Ser 


He 


Ala 


165 


170 




175 




Phe Phe 


Gly Gly Trp Pro Lys Pro Ala 


Ser 


Leu 


Ser 




185 


190 






Val Arg 


Leu Glu Pro Glu Asp Gin Val 


Trp 


Val 


Gin 



180 

Ala Met 

195 200 205 

61 
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Val Gly Val Gly Asp Tyr lie Gly lie Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 103 

<211> 1338 

<212> DNA 

<213> Homo sapiens 



<400> 103 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 
actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggtgc 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 
gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 
tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 
gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 
ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 
gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 
tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 
gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 
gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 
gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 
ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 
ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 
gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 
aaaaaaaagg gcggccgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1338 



<210> 104 
<211> 243 
< 2 1 2 > PRT 

<213> Homo sapiens 



<400> 104 








Met Arg 


Pro 


Leu Leu Val 
5 


Leu Leu 


1 

Pro Pro 


Leu 


Asp Asp Asn 


Lys He 






20 




Gly Leu 


Pro 


Gly Thr Pro 


Gly His 


35 




40 


Arg Asp 


Gly 


Arg Asp Gly 


Arg Asp 


50 






55 


Lys Gly 


Glu 


Gly Gly Arg 


Pro Gly 


65 




70 




Gly Pro 


Arg 


Gly Glu Ala 


Gly Pro 






85 




Glu Cys 


Ser 


Val Pro Pro 


Arg Ser 






100 




Ser Arg 


Val 


Pro Pro Pro 


Ser Asp 



Leu 


Leu Gly Leu Ala 


Ala 


Gly 


Ser 




10 




15 




Pro 


Ser Leu Cys Pro 


Gly 


His 


Pro 


25 




30 






His 


Gly Ser Gin Gly 


Leu 


Pro 


Gly 




45 








Gly 


Ala Pro Gly Ala 


Pro 


Gly 


Glu 




60 








Leu 


Pro Gly Pro Arg 


Gly 


Asp 


Pro 




75 






80 


Ala 


Gly Pro Thr Gly 


Pro 


Val 


Gly 




90 




95 




Ala 


Phe Ser Ala Lys 


Arg 


Ser 


Glu 


105 




110 




Val 


Ala 


Pro Leu Pro Phe 


Asp 


Arg 
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X15 120 125 

Leu val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 

145 I 50 155 , It 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser lie Ala 

165 l -70 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 "5 I 90 

Glv Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 

* 195 200 205 

val Gly Val Gly Asp Tyr lie Gly lie Tyr Ala Ser lie Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 105 
<211> 1338 
<212> DNA 

<213> Homo sapiens 
<400> 105 



400> 10b 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 
cagcgc?a?g aggcSctcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 
acLLcgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 
|acc?cSgc cgc^ggaga ggcgggaccc gcggggccca ccgggcctgt cggggagtgc 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt ^ctccgccg 
tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca "acgacgcc 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ^catgccacc 
Itctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc "tggtgagg 
ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 
gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 
tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact "tagaagga 
gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 
gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 
gctgtScl atcStctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 
ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 
SStSctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 
gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 
aaaaaaaagg gcggccgc 

<210> 106 
<211> 243 
<212> PRT 

<213> Homo sapiens 

< Met > A^g 6 pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 
1 5 10 15 

Pro Pro Leu Asp Asp Asn Lys lie Pro Ser Leu Cys Pro Gly His Pro 
20 25 30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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900 

960 
1020 
1080 
1140 
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1260 
1320 
1338 
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Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 

35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 

50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 

85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 

100 105 HO 

Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg Val 

H5 120 125 

Leu Ala Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser lie Ala 

165 170 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 185 190 

Glv Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 

195 200 205 

Val Gly Val Gly Asp Tyr He Gly lie Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 107 
<211> 1338 
<212> DNA 

<213> Homo sapiens 



<400> 107 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 
actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 3 00 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 
gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 
tctgacgcac ccttgccctt cgaccgcgtg ctggcgaacg agcagggaca ttacgacgcc 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 
gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccat tgcctctttc 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 
ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 
gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 
tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 
gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 
gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 
gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 
qqagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 
ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 
gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 
aaaaaaaagg gcggccgc 



60 
120 
180 
240 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1338 
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210> 108 
211> 243 
212> PRT 

213> Homo sapiens 
400> 108 

Met Arg Pro Leu Leu Val Leu Leu Leu Leu Gly Leu Ala Ala Gly Ser 

1 ~ 5 10 15 

Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly His Pro 

20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 

35 J 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 

50 " 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Asp Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Ala Gly Pro Thr Gly Pro Ala Gly 

85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Ala Lys Arg Ser Glu 

100 105 HO 

Ser Arg Val Pro Pro Pro Ser Asp Ala Pro Leu Pro Phe Asp Arg Val 

115 120 125 

Leu Val Asn Glu Gin Gly His Tyr Asp Ala Val Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arg Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Glu Ser Leu Ala 

165 170 175 

Ser Phe Phe Gin Phe Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 

195 200 205 

Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 109 
<211> 1338 
<212> DNA 

<213> Homo sapiens 
<400> 109 

gtcgacccac gcgtccggga ctggggtgac ggcagggcag ggggcgcctg gccggggaga 
agcgcggggg ctggagcacc accaactgga gggtccggag tagcgagcgc cccgaaggag 
gccatcgggg agccgggagg ggggactgcg agaggacccc ggcgtccggg ctcccggtgc 180 
cagcgctatg aggccactcc tcgtcctgct gctcctgggc ctggcggccg gctcgccccc 24 0 

actggacgac aacaagatcc ccagcctctg cccggggcac cccggccttc caggcacgcc 
gggccaccat ggcagccagg gcttgccggg ccgcgatggc cgcgacggcc gcgacggcgc 
gcccggggct ccgggagaga aaggcgaggg cgggaggcgg gactgccggg acctcgaggg 
gaccccgggc cgcgaggaga ggcgggaccc gcggggccca ccgggcctgc cggggagtgc 
tcggtgcctc cgcgatccgc cttcagcgcc aagcgctccg agagccgggt gcctccgccg 540 
tctgacgcac ccttgccctt cgaccgcgtg ctggtgaacg agcagggaca ttacgacgcc 
gtcaccggca agttcacctg ccaggtgcct ggggtctact acttcgccgt ccatgccacc 
gtctaccggg ccagcctgca gtttgatctg gtgaagaatg gcgaatccct tgcctctttc 
ttccagtttt tcggggggtg gcccaagcca gcctcgctct cggggggggc catggtgagg 



60 
120 



300 
360 
420 
480 



600 
660 
720 
780 
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ctggagcctg aggaccaagt gtgggtgcag gtgggtgtgg gtgactacat tggcatctat 840 

gccagcatca agacagacag caccttctcc ggatttctgg tgtactccga ctggcacagc 900 

tccccagtct ttgcttagtg cccactgcaa agtgagctca tgctctcact cctagaagga 960 

gggtgtgagg ctgacaacct ggtcatccag gagggctggc ccccctggaa tattgtgaat 1020 

gactagggag gtggggtaga gcactctccg tcctgctgct ggcaaggaat gggaacagtg 1080 

gctgtctgcg atcaggtctg gcagcatggg gcagtggctg gatttctgcc caagaccaga 1140 

ggagtgtgct gtgctggcaa gtgtaagtcc cccagttgct ctggtccagg agcccacggt 1200 

ggggtgctct cttcctggtc ctctgcttct ctggatcctc cccaccccct cctgctcctg 1260 

gggccggccc ttttctcaga gatcactcaa taaacctaag aaccctccaa aaaaaaaaaa 1320 

aaaaaaaagg gcggccgc 1338 

<210> 110 
<211> 406 
<212> PRT 
<213> Homo sapiens 

<400> 110 

Met Gly Pro Ser Thr Pro Leu Leu lie Leu Phe Leu Leu Ser Trp Ser 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr lie Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Ala Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 

210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 " 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 
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325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
4 05 

<210> 111 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 111 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 300 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 3 60 

cgggaggcag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 7 80 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 
tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 

<210> 112 
<211> 406 
<212> PRT 

<213> Homo sapiens 
<400> 112 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 
15 10 15 



1800 
1831 
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Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp He Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 

210 215 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
225 ~ 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Gly Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 " 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg 
385 ~ 390 395 400 

Lys Lys Glu Glu Glu Val 
405 

<210> 113 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

<400> 113 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 
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tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 300 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 420 

aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatata 480 

gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 540 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 

ttagatggga cacagaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 720 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 

tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 

agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 

catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac i««n 
gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 
tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 

<210> 114 
<211> 406 
<212> PRT 
<213> Homo sapiens 

<400> 114 

Met Gly Pro Ser Thr Pro Leu Leu He Leu Phe Leu Leu Ser Trp Ser 

1 5 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Thr He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Phe Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Thr Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Gly Tyr Thr He Ser Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Pro Ala Gly He Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Gin Thr Glu Lys He Tyr Val Leu Asp Gly Thr Gin 



1680 
1740 
1800 
1831 
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170 l" 75 



Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 



180 185 



Met Ala Ala Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg 

210 21 s 220 

Pro Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 
230 235 * HU 

III lie Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 
Phe Pro Ala Glu Gly Leu lie Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 2 

Tyr lie Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

280 285 
Thr Arg III Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 , » 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 

III Glu Ala Ala Phe III He Cys Gly Thr III Tyr Val Val Tyr Asn 

325 330 
Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 

355 360 36S 

Tvr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

375 

Tyr Ma Trp Asp Asp Gly Tyr Gin lie Val Tyr Lys Leu Glu Met Arg 

385 390 39 

Lys Lys Glu Glu Glu Val 
405 

<210> 115 

<211> 1831 

<212> DNA 

<213> Homo sapiens 

'gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 

tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 

cttttg?cat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 

cgccgactag SgcSLg. ggaacggctg gcccagtgcc aggaccagag tagtcggcat 240 

g?tgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt OTcagagaag 300 

gagcgggagg clctcagaac tgaggccgac accatctocg ggagagtgga tcgtctggag 360 

cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga 9"tgatgag 420 

aStgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa ^acgatatg JJJ 

qjgacagacr gtggctacac aatctctcaa gtgagatcaa tgaagattct 9^gcgattt 540 

Sggcccag ctggtatatg gaccaaggat ccactggggc aaacagagaa aatctacgtg 600 

ttagftggga cac!gaatga cacagccttt gtcttcccaa ggctgcgtga cttcaccctt 660 

gccltggctg cccg^aaagc ttcccgagtc cgggtgccct tcccctgggt *ggcacaggg 720 

?agctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 

ogrggrgaga tggagaLac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 840 

gSacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 

accracatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 

caroacaqqc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 1020 
Sfacacac catgtcccag Igagaatgct gaggctgcct ttgtcatctg tggg-ccctc 

tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ^ttgatgcc 11 
agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 
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catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 

cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 

gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 

cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 

aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 

agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 

atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 

gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 

gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 1740 

tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 

aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 



<210> 116 

<211> 406 

<212> PRT 

<213> Homo sapiens 



<400> 116 



Met 


Gly 


Pro 


Ser 


Thr 


Pro 


Leu 


Leu 


He 


Leu 


Phe 


Leu 


Leu 


Ser 


Trp 


Ser 


1 








5 










10 










15 




Gly 


Pro 


Leu 


Gin 


Gly 


Gin 


Gin 


His 


His 


Leu 


Val 


Glu 


Tyr 


Met 


Glu 


Arg 








20 










25 










30 






Arg 


Leu 


Ala 


Ala 


Leu 


Glu 


Glu 


Arg 


Leu 


Ala 


Gin 


Cys 


Gin 


Asp 


Gin 


Ser 






35 










40 










45 








Ser 


Arg 


His 


Ala 


Ala 


Glu 


Leu 


Arg 


Asp 


Phe 


Lys 


Asn 


Lys 


Met 


Leu 


Pro 




50 










55 










60 










Leu 


Leu 


Glu 


Val 


Ala 


Glu 


Lys 


Glu 


Arg 


Glu 


Ala 


Leu 


Arg 


Thr 


Glu 


Ala 


65 










70 










75 










80 


Asp 


Thr 


He 


Ser 


Gly 


Arg 


Val 


Asp 


Arg 


Leu 


Glu 


Arg 


Glu 


Val 


Asp 


Tyr 










85 










90 










95 




Leu 


Glu 


Thr 


Gin 


Asn 


Pro 


Ala 


Leu 


Pro 


Cys 


Val 


Glu 


Phe 


Asp 


Glu 


Lys 








100 










105 










110 






Val 


Thr 


Gly 


Gly 


Pro 


Gly 


Thr 


Lys 


Gly 


Lys 


Gly 


Arg 


Arg 


Asn 


Glu 


Lys 






115 










120 










125 








Tyr 


Asp 


Met 


Val 


Thr 


Asp 


Cys 


Gly 


Tyr 


Thr 


He 


Ser 


Gin 


Val 


Arg 


Ser 




130 










135 










140 










Met 


Lys 


He 


Leu 


Lys 


Arg 


Phe 


Gly 


Gly 


Pro 


Ala 


Gly 


Leu 


Trp 


Thr 


Lys 


145 










150 










155 










160 


Asp 


Pro 


Leu 


Gly 


Gin 


Thr 


Glu 


Lys 


He 


Tyr 


Val 


Leu 


Asp 


Gly 


Thr 


Gin 










165 










170 










175 




Asn 


Asp 


Thr 


Val 


Phe 


Val 


Phe 


Pro 


Arg 


Leu 


Arg 


Asp 


Phe 


Thr 


Leu 


Ala 








180 










185 










190 






Met 


Ala 


Ala 


Arg 


Lys 


Ala 


Ser 


Arg 


Val 


Arg 


Val 


Pro 


Phe 


Pro 


Trp 


Val 






195 










200 










205 








Gly 


Thr 


Gly 


Gin 


Leu 


Val 


Tyr 


Gly 


Gly 


Phe 


Leu 


Tyr 


Phe 


Ala 


Arg 


Arg 




210 










215 










220 










Pro 


Pro 


Gly 


Arg 


Pro 


Gly 


Gly 


Gly 


Gly 


Glu 


Met 


Glu 


Asn 


Thr 


Leu 


Gin 


225 










230 










235 










240 


Leu 


He 


Lys 


Phe 


His 


Leu 


Ala 


Asn 


Arg 


Thr 


Val 


Val 


Asp 


Ser 


Ser 


Val 










245 










250 










255 




Phe 


Pro 


Ala 


Glu 


Gly 


Leu 


He 


Pro 


Pro 


Tyr 


Gly 


Leu 


Thr 


Ala 


Asp 


Thr 








260 










265 










270 






Tyr 


He 


Asp 


Leu 


Ala 


Ala 


Asp 


Glu 


Glu 


Gly 


Leu 


Trp 


Ala 


Val 


Tyr 


Ala 






275 










280 










285 








Thr 


Arg 


Glu 


Asp 


Asp 


Arg 


His 


Leu 


Cys 


Leu 


Ala 


Lys 


Leu 


Asp 


Pro 


Gin 




290 










295 










300 










Thr 


Leu 


Asp 


Thr 


Glu 


Gin 


Gin 


Trp 


Asp 


Thr 


Pro 


Cys 


Pro 


Arg 


Glu 


Asn 


305 










310 










315 










320 
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Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Thr Pro Glu Arg Ala Ala Leu Pro Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Arg 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 

<210> 117 
<211> 1831 
<212> DNA 

<213> Homo sapiens 
<400> 117 

gtcgacccac gcgtccgcgg acgcgtgggt gaggggaaga ggctgactgt acgttccttc 60 
tactctggca ccactctcca ggctgccatg gggcccagca cccctctcct catcttgttc 120 
cttttgtcat ggtcgggacc cctccaagga cagcagcacc accttgtgga gtacatggaa 180 
cgccgactag ctgctttaga ggaacggctg gcccagtgcc aggaccagag tagtcggcat 24 0 

gctgctgagc tgcgggactt caagaacaag atgctgccac tgctggaggt ggcagagaag 3 00 

gagcgggagg cactcagaac tgaggccgac accatctccg ggagagtgga tcgtctggag 360 
cgggaggtag actatctgga gacccagaac ccagctctgc cctgtgtaga gtttgatgag 42 0 
aaggtgactg gaggccctgg gaccaaaggc aagggaagaa ggaatgagaa gtacgatatg 4 80 
gtgacagact gtggctacac aatctctcaa gtgagatcaa tgaagattct gaagcgattt 54 0 

ggtggcccag ctggtctatg gaccaaggat ccactggggc aaacagagaa gatctacgtg 600 
ttagatggga cacagaatga cacagtcttt gtcttcccaa ggctgcgtga cttcaccctt 660 
gccatggctg cccggaaagc ttcccgagtc cgggtgccct tcccctgggt aggcacaggg 72 0 

cagctggtat atggtggctt tctttatttt gctcggaggc ctcctggaag acctggtgga 780 
ggtggtgaga tggagaacac tttgcagcta atcaaattcc acctggcaaa ccgaacagtg 84 0 

gtggacagct cagtattccc agcagagggg ctgatccccc cctacggctt gacagcagac 900 
acctacatcg acctggcagc tgatgaggaa ggtctttggg ctgtctatgc cacccgggag 960 
gatgacaggc acttgtgtct ggccaagtta gatccacaga cactggacac agagcagcag 102 0 
tgggacacac catgtcccag agagaatgct gaggctgcct ttgtcatctg tgggaccctc 1080 
tatgtcgtct ataacacccg tcctgccagt cgggcccgca tccagtgctc ctttgatgcc 1140 
agcggcaccc tgacccctga acgggcagca ctcccttatt ttccccgcag atatggtgcc 1200 
catgccagcc tccgctataa cccccgagaa cgccagctct atgcctggga tgatggctac 1260 
cagattgtct ataagctgga gatgaggaag aaagaggagg aggtttgagg agctagcctt 1320 
gttttttgca tctttctcac tcccatacat ttatattata tccccactaa atttcttgtt 1380 
cctcattctt caaatgtggg ccagttgtgg ctcaaatcct ctatattttt agccaatggc 1440 
aatcaaattc tttcagctcc tttgtttcat acggaactcc agatcctgag taatcctttt 1500 
agagcccgaa gagtcaaaac cctcaatgtt ccctcctgct ctcctgcccc atgtcaacaa 1560 
atttcaggct aaggatgccc cagacccagg gctctaacct tgtatgcggg caggcccagg 1620 
gagcaggcag cagtgttctt cccctcagag tgacttgggg agggagaaat aggaggagac 1680 
gtccagctct gtcctctctt cctcactcct cccttcagtg tcctgaggaa caggactttc 174 0 
tccacattgt tttgtattgc aacattttgc attaaaagga aaatccactg ctaaaaaaaa 1800 
aaaaaaaaaa aaaaaaaaaa agggcggccg c 1831 

<210> 118 
<211> 242 
<212> PRT 

<213> Mus musculus 
<400> 118 

Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 
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1 5 



Pro Pro Leu Asp Asp 


Asn 


Lys 


He 


20 








Gly Leu Pro Gly Thr 


Pro 


Gly 


His 


35 






40 


Arg Asp Gly Arg Asp 


Gly 


Arg 


Asp 


50 




55 




Lys Gly Glu Gly Gly 


Arg 


Pro 


Gly 


65 


70 






Gly Pro Arg Gly Glu 


Gly 


Pro 


Met 


85 








Cys Ser Val Pro Pro 


Arg 


Ser 


Ala 


100 








Arg Val Pro Pro Pro 


Ala 


Asp 


Thr 


115 






120 


Leu Asn Glu Gin Gly 


His 


Tyr 


Asp 


130 




135 




Gin Val Pro Gly Val 


Tyr 


Tyr 


Phe 


14 5 


150 






Ala Ser Leu Gin Phe 


Asp 


Leu 


val 


165 








Phe Phe Gin Tyr Phe 


Gly 


Gly 


Trp 


180 








Gly Ala Met Val Arg 


Leu 


Glu 


Pro 


195 






200 


Gly Val Gly Asp Tyr 


He 


Gly 


He 


210 




215 




Thr Phe Ser Gly Phe 


Leu 


Val 


Tyr 


225 


230 







Phe Ala 





10 




15 




Pro 


Ser 


Leu Cys Pro Gly Gin 


Pro 


25 






30 




His 


Gly 


Ser Gin Gly 


Leu Pro 


Gly 






45 






Gly 


Ala 


Pro Gly Ala Pro Gly Glu 






60 






Leu 


Pro 


Gly Pro Arg Gly Glu 


Pro 






75 




80 


Gly 


Ala 


He Gly Pro Ala Gly Glu 




90 




95 




Phe 


Ser 


Ala Lys Arg 


Ser Glu 


Ser 


105 






110 




Pro 


Leu 


Pro Phe Asp 


Arg Val 


Leu 






125 






Pro 


Thr 


Thr Gly Lys 


Phe Thr 


Cys 






140 






Ala 


Val 


His Ala Thr 


Val Tyr 


Arg 






155 




160 


Lys 


Asn 


Gly Gin Ser 


He Ala 


Ser 


170 




175 




Pro 


Lys 


Pro Ala Ser 


Leu Ser Gly 


185 






190 




Glu 


Asp 


Gin Val Trp 


Val Gin 


Val 






205 






Tyr 


Ala 


Ser He Lys Thr Asp 


Ser 






220 






Ser 


Asp 


Trp His Ser 


Ser Pro 


Val 




235 




240 



<210> 119 
<211> 1263 
<212> DNA 

<213> Mus musculus 
<400> 119 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 300 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 360 

cacgtgggga gcccgggccg cgtggagagg tagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 480 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 
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ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

1263 

cgc 

210> 120 
211> 243 
212> PRT 

213> Mus musculus 

400> 120 ^ 
Met Arg Pro Leu Leu Ala Leu Leu Leu Leu Gly Leu Val Ser Gly Ser 

15 10 15 

Pro Pro Leu Asp Asp Asn Lys He Pro Ser Leu Cys Pro Gly Gin Pro 

20 25 30 

Gly Leu Pro Gly Thr Pro Gly His His Gly Ser Gin Gly Leu Pro Gly 

35 40 45 

Arg Asp Gly Arg Asp Gly Arg Asp Gly Ala Pro Gly Ala Pro Gly Glu 

50 55 60 

Lys Gly Glu Gly Gly Arg Pro Gly Leu Pro Gly Pro Arg Gly Glu Pro 
65 70 75 80 

Gly Pro Arg Gly Glu Ala Gly Pro Met Gly Ala He Gly Pro Ala Gly 

85 90 95 

Glu Cys Ser Val Pro Pro Arg Ser Ala Phe Ser Val Lys Arg Ser Glu 

100 105 HO 

Ser Arg Val Pro Pro Pro Ala Asp Thr Pro Leu Pro Phe Asp Arg Val 

115 120 125 

Leu Leu Asn Glu Gin Gly His Tyr Asp Pro Thr Thr Gly Lys Phe Thr 

130 135 140 

Cys Gin Val Pro Gly Val Tyr Tyr Phe Ala Val His Ala Thr Val Tyr 
145 150 155 160 

Arq Ala Ser Leu Gin Phe Asp Leu Val Lys Asn Gly Gin Ser He Ala 

165 170 175 

Ser Phe Phe Gin Tyr Phe Gly Gly Trp Pro Lys Pro Ala Ser Leu Ser 

180 185 190 

Gly Gly Ala Met Val Arg Leu Glu Pro Glu Asp Gin Val Trp Val Gin 

195 200 205 

Val Gly Val Gly Asp Tyr He Gly He Tyr Ala Ser He Lys Thr Asp 

210 215 220 

Ser Thr Phe Ser Gly Phe Leu Val Tyr Ser Asp Trp His Ser Ser Pro 
225 230 235 240 

Val Phe Ala 



<210> 121 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 121 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 
cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 
ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 
ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 
gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 
acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 
cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 
gggagtgctc ggtaccccca cgatcagtct tcagtgccaa gcgatccgag agccgggtac 
ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 
acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 780 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 102 0 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 

cgc 1263 

<210> 122 
<211> 243 
<212> PRT 

<213> Mus musculus 



<400> 122 



Met 


Arg 


Pro 


Leu 


Leu 


Ala 


Leu Leu 


1 








5 






Pro 


Pro 


Leu 


Asp 


Asp 


Asn 


Lys He 








20 








Gly 


Leu 


Pro 


Gly 


Thr 


Pro Gly His 






35 








40 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly Arg Asp 




50 










55 


Lys 


Gly 


Glu 


Gly 


Gly 


Arg 


Pro Gly 


65 










70 




Gly 


Pro 


Arg 


Gly 


Glu 


Ala Gly Pro 










85 






Glu 


Cys 


Ser 


Val 


Pro 


Pro Arg Ser 








100 








Ser 


Arg 


Val 


Pro 


Pro 


Pro Ala Asp 






115 








120 


Leu 


Leu 


Asn 


Glu 


Gin 


Gly His Tyr 




130 










135 


Cys 


Gin 


Val 


Pro 


Gly 


Val 


Tyr Tyr 


14 5 










150 




Arg 


Ala 


Ser 


Leu 


Gin 


Phe 


Asp Leu 










165 






Ser 


Phe 


Phe 


Gin 


Tyr 


Phe 


Gly Gly 








180 








Gly 


Gly 


Ala 


Met 


Val 


Arg 


Leu Glu 






195 








200 


Val 


Gly 


Val 


Gly 


Asp 


Tyr 


He Gly 




210 










215 


Ser 


Thr 


Phe 


Ser 


Gly 


Phe 


Leu Val 


225 










230 




Val 


Phe 


Ala 











Leu 


Leu 
10 


Gly 


Leu 


Val 


Ser 


Gly 
15 


Ser 


Pro 


Ser 


Leu 


Cys 


Pro 


Gly 


Gin 


Pro 


25 










30 






His 


Gly 


Ser 


Gin 


Gly 
45 


Leu 


Pro 


Gly 


Gly 


Ala 


Pro 


Gly 
60 


Ala 


Pro 


Gly 


Glu 


Leu 


Pro 


Gly 
75 


Pro 


Arg 


Gly 


Glu 


Pro 
80 


Met 


Gly 
90 


Ala 


He 


Gly 


Pro 


Ala 
95 


Gly 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 


105 










110 






Thr 


Pro 


Leu 


Pro 


Phe 
125 


Asp 


Arg 


Ala 


Asp 


Pro 


Thr 


Thr 
140 


Gly 


Lys 


Phe 


Thr 


Phe 


Ala 


Val 
155 


His 


Ala 


Thr 


Val 


Tyr 
160 


Val 


Lys 
170 


Asn 


Gly 


Gin 


Ser 


He 
175 


Ala 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 


185 










190 






Pro 


Glu 


Asp 


Gin 


Val 
205 


Trp 


Val 


Gin 


He 


Tyr 


Ala 


Ser 
220 


He 


Lys 


Thr 


Asp 


Tyr 


Ser 


Asp 
235 


Trp 


His 


Ser 


Ser 


Pro 
240 



<210> 123 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 123 
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gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag 

ctccgccagc cgacacaccc ctacctttcg accgtgcgct gctaaatgag 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac 

acgccactgt ctaccgggcc agcttgcagt ttgatcttgt caaaaacggg 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa 
cgc 

<210> 124 
<211> 243 
<212> PRT 

<213> Mus musculus 



<400> 124 



Met 


Arg 


Pro 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Gly 


Leu 


Val 


Ser 


Gly 


Ser 


1 








5 










10 










15 




Pro 


Pro 


Leu 


Asp 


Asp 


Asn 


Lys 


He 


Pro 


Ser 


Leu 


Cys 


Pro 


Gly 


Gin 


Pro 








20 










25 










30 






Gly 


Leu 


Pro 


Gly 


Thr 


Pro 


Gly 


His 


His 


Gly 


Ser 


Gin 


Gly 


Leu 


Pro 


Gly 






35 










40 










45 








Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Arg 


Asp 


Gly 


Ala 


Pro 


Gly 


Ala 


Pro 


Gly 


Glu 




50 










55 










60 










Lys 


Gly 


Glu 


Gly 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Gly 


Pro 


Arg 


Gly 


Glu 


Pro 


65 










70 










75 










80 


Gly 


Pro 


Arg 


Gly 


Glu 


Ala 


Gly 


Pro 


Met 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 










85 










90 










95 




Glu 


Cys 


Ser 


Val 


Pro 


Pro 


Arg 


Ser 


Ala 


Phe 


Ser 


Ala 


Lys 


Arg 


Ser 


Glu 








100 










105 










110 






Ser 


Arg 


Val 


Pro 


Pro 


Pro 


Ala 


Asp 


Thr 


Pro 


Leu 


Pro 


Phe 


Asp 


Arg 


Val 






115 










120 










125 








Leu 


Leu 


Asn 


Glu 


Gin 


Gly 


His 


Tyr 


Asp 


Pro 


Thr 


Thr 


Gly 


Lys 


Phe 


Thr 




130 










135 










140 










Cys 


Gin 


Val 


Pro 


Gly 


Val 


Tyr 


Tyr 


Phe 


Ala 


Val 


His 


Ala 


Thr 


Val 


Tyr 


145 










150 










155 










160 


Arg 


Ala 


Ser 


Leu 


Gin 


Phe 


Asp 


He 


Val 


Lys 


Asn 


Gly 


Gin 


Ser 


He 


Ala 










165 










170 










175 




Ser 


Phe 


Phe 


Gin 


Tyr 


Phe 


Gly 


Gly 


Trp 


Pro 


Lys 


Pro 


Ala 


Ser 


Leu 


Ser 








180 










185 










190 






Gly 


Gly 


Ala 


Met 


Val 


Arg 


Leu 


Glu 


Pro 


Glu 


Asp 


Gin 


Val 


Trp 


Val 


Gin 






195 










200 










205 








Val 


Gly 


Val 


Gly 


Asp 


Tyr 


He 


Gly 


He 


Tyr 


Ala 


Ser 


He 


Lys 


Thr 


Asp 




210 










215 










220 










Ser 


Thr 


Phe 


Ser 


Gly 


Phe 


Leu 


Val 


Tyr 


Ser 


Asp 


Trp 


His 


Ser 


Ser 


Pro 
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aggagtcagt 60 

gtctgagcct 120 

gtgtcaggct 180 

ggccttccag 240 

gatggccgcg 3 00 

ctacctggcc 360 

gggcctgcgg 420 

agccgggtac 4 80 

cagggccatt 54 0 

tttgctgtgc 600 

cagtccatcg 660 

gggggtgcga 720 

gattacattg 780 

tattctgact 840 

cttgctcctc 900 

ccctggaata 960 

aatgaatgga 1020 

ccaggacttt 1080 

ggatgctcca 1140 

cagggcaggc 12 00 

aaagggcggc 1260 
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225 230 235 240 

Val Phe Ala 



<210> 125 

<211> 1263 

<212> DNA 

<213> Mus musculus 

<400> 125 

gtcgacccac gcgtccgcgc tgtgaagcca gcaaggagca accagaagct aggagtcagt 60 

cagcaaggac aggggctgcc tgcctacaga ctacaagaga ggttcctgga gtctgagcct 120 

ccggggtcac caccatgagg ccacttcttg cccttctgct tctgggtctg gtgtcaggct 180 

ctcctcctct ggacgacaac aagatcccca gcctgtgtcc cgggcagccc ggccttccag 240 

gcacaccagg tcaccatggc agccaaggcc tgcctggccg tgacggccgt gatggccgcg 3 00 

acggtgcacc cggagctccg ggagagaaag gcgagggcgg gagaccggga ctacctggcc 3 60 

cacgtgggga gcccgggccg cgtggagagg cagggcccat gggggctatc gggcctgcgg 420 

gggagtgctc ggtaccccca cgatcagcct tcagtgccaa gcgatccgag agccgggtac 4 80 

ctccgccagc cgacacaccc ctacctttcg accgtgtgct gctaaatgag cagggccatt 540 

acgaccccac tactggcaag ttcacctgcc aagtgcctgg cgtctactac tttgctgtgc 600 

acgccactgt ctaccgggcc agcttgcagt ttgatattgt caaaaacggg cagtccatcg 660 

cctctttctt ccagtatttt ggggggtggc ccaagccagc ctcgctctca gggggtgcga 720 

tggtaaggct agaacctgag gaccaggtgt gggtgcaggt gggcgtgggt gattacattg 7 80 

gcatctatgc cagcatcaag acagacagta ccttctctgg atttctcgtc tattctgact 840 

ggcacagctc cccagtcttc gcttaaaaca cagtgaaccc ggagctggca cttgctcctc 900 

agtggagggt gtgacactaa cccgcgcagc gcataccagg agggctggcc ccctggaata 960 

ttgtgaatga cttaggaaga gagggagcca cttccagtcc cactgctggc aatgaatgga 1020 

gacaggctgt ctgaggtcaa gacagcgtgg agcagtggct gggtttctgc ccaggacttt 1080 

agaatgcagt aggctggcag ctgtgggtcc tggcccagga ctccaaggtg ggatgctcca 1140 

ttcctagtcc tgtgtcccct ctaggtccct gactccatct ctgctgctcc cagggcaggc 1200 

ctttttctca gaggtcactt aataaaccta aaatcctcaa aaaaaaaaaa aaagggcggc 1260 
cgc 

<210> 126 

<211> 406 

<212> PRT 

<213> Mus musculus 

<400> 126 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg He Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys lie Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
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145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu lie Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr lie Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tvr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 

<210> 127 

<211> 1721 

<212> DNA 

<213> Mus musculus 

<400> 127 _ 
gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 
gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 
catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 
aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtatt 
gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 
gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 
atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 
tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 
gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 
cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 
gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 
gggggtggtg aattggagaa cactctgcag ctgatcaaat >ttcacttggc aaaccgaaca 
gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 
gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 
gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 
cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 
ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
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1620 
1680 
1721 



gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc i«n 
tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 
aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 

<210> 128 
<211> 406 
<212> PRT 
<213> Mus musculus 

<400> 128 m . 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 

20 25 30 

Arq Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp He Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 ' 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 
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Ala Glu Ala Ala Phe Val He Cys 
325 

Thr Arg Pro Ala Ser Arg Ala Arg 
340 

Gly Thr Leu Ala Pro Glu Arg Ala 
355 360 
Tyr Gly Ala His Ala Ser Leu Arg 

370 375 
Tyr Ala Trp Asp Asp Gly Tyr Gin 
385 390 
Lys Lys Glu Glu Glu Val 
4 05 



Gly Thr Leu Tyr Val Val Tyr Asn 

330 335 
He Gin Cys Ser Phe Asp Ala Ser 
345 350 
Ala Leu Ser Tyr Phe Pro Arg Arg 
365 

Tyr Asn Pro Arg Glu Arg Gin Leu 
380 

He Val Tyr Lys Leu Glu Met Lys 
395 400 



<210> 129 

<211> 1721 

<212> DNA 

<213> Mus musculus 



<400> 129 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 

ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 

gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 2 40 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 3 00 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 3 60 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atagtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 7 80 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 84 0 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 162 0 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 130 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 130 

Met Gly Pro Ser Ala Pro Leu Leu Leu Leu Phe Phe Leu Ser Trp Thr 

15 10 15 

Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met Glu Arg 
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20 25 30 

Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Ala Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala 

180 185 190 

Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 ' 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 390 395 400 

Lys Lys Glu Glu Glu Val 
405 

<210> 131 

<211> 1721 

<212> DNA 

<213> Mus musculus 

<400> 131 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
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gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 

catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 240 

aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 

gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 

gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 

atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 

tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 540 

gcgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 

cttgccatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 

gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 

gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 

gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 

gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 

gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 

cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 

ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 

gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 

gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 

taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 

cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 

tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 

aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 1440 

aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 1500 

aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 1560 

ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 1620 

tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 1680 

aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 1721 

<210> 132 
<211> 406 
<212> PRT 

<213> Mus musculus 
<400> 132 



Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin Asp Gin Ser 

35 40 45 

Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn Lys Met Leu Pro 

50 55 60 

Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Thr Leu Arg Thr Glu Ala 
65 70 75 80 

Asp Ser He Ser Gly Arg Val Asp Arg Leu Glu Arg Glu Val Asp Tyr 

85 90 95 

Leu Glu Thr Gin Asn Pro Ala Leu Pro Cys Val Glu Leu Asp Glu Lys 

100 105 HO 

Val Thr Gly Gly Pro Gly Ala Lys Gly Lys Gly Arg Arg Asn Glu Lys 

115 120 125 

Tyr Asp Met Val Thr Asp Cys Ser Tyr Thr Val Ala Gin Val Arg Ser 

130 135 140 

Met Lys He Leu Lys Arg Phe Gly Gly Ser Val Gly Leu Trp Thr Lys 
145 150 155 160 

Asp Pro Leu Gly Pro Ala Glu Lys He Tyr Val Leu Asp Gly Thr Gin 

165 170 175 

Asn Asp Thr Ala Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Val 
180 185 190 
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Met Ala Ala Arg Lys Ala Ser Arg He Arg Val Pro Phe Pro Trp Val 

195 200 . 205 

Gly Thr Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Tyr Ala Arg Arg 

210 215 220 

Pro Pro Gly Gly Pro Gly Gly Gly Gly Glu Leu Glu Asn Thr Leu Gin 
225 230 235 240 

Leu He Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser Val 

245 250 255 

Phe Pro Ala Glu Ser Leu He Pro Pro Tyr Gly Leu Thr Ala Asp Thr 

260 265 270 

Tyr He Asp Leu Ala Ala Asp Glu Glu Gly Leu Trp Ala Val Tyr Ala 

275 280 285 

Thr Arg Asp Asp Asp Arg His Leu Cys Leu Ala Lys Leu Asp Pro Gin 

290 295 300 

Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro Cys Pro Arg Glu Asn 
305 310 315 320 

Ala Glu Ala Ala Phe Val He Cys Gly Thr Leu Tyr Val Val Tyr Asn 

325 330 335 

Thr Arg Pro Ala Ser Arg Ala Arg He Gin Cys Ser Phe Asp Ala Ser 

340 345 350 

Gly Thr Leu Ala Pro Glu Arg Ala Ala Leu Ser Tyr Phe Pro Arg Arg 

355 360 365 

Tyr Gly Ala His Ala Ser Leu Arg Tyr Asn Pro Arg Glu Arg Gin Leu 

370 375 380 

Tyr Ala Trp Asp Asp Gly Tyr Gin He Val Tyr Lys Leu Glu Met Lys 
385 * 390 395 400 

Lys Lys Glu Glu Glu Val 
4 05 

<210> 133 
<211> 1721 
<212> DNA 

<213> Mus musculus 
<400> 133 

gtcgacccac gcgtccgact taaggctgcc atggggccca gtgctcctct gctgctcctc 60 
ttctttttgt catggacggg accccttcag ggacagcagc accaccttgt ggagtacatg 120 
gaacgccgac tagctgcctt agaggaacgg ctggcccaat gccaggatca gagtagtcgg 180 
catgctgccg agcttcggga cttcaaaaac aagatgttgc ctctcctgga ggtggcagag 24 0 
aaggagcggg agaccctcag aactgaagca gactccatct caggaagagt ggaccgtctt 300 
gaaagggagg tagactatct ggagacacag aacccagctt tgccctgtgt agagctggat 360 
gagaaggtga ctggaggtcc tggagccaaa ggcaagggcc gaagaaatga gaaatacgat 420 
atggtgacgg actgtagcta cacagtcgct caggtgaggt caatgaagat cctgaagcgg 4 80 
tttggtggtt cagttggcct atggaccaag gatccgctgg ggccagcaga gaagatctac 54 0 
gtgttagacg gcacccagaa cgacacggct tttgtcttcc caaggctgcg tgacttcacc 600 
cttgtcatgg ctgcccggaa agcttcccga attcgggtgc ccttcccctg ggtaggcacg 660 
gggcagctgg tgtacggtgg cttcctttat tatgctcgaa ggcctcctgg aggacctgga 720 
gggggtggtg aattggagaa cactctgcag ctgatcaaat ttcacttggc aaaccgaaca 780 
gtggtggata gctcagtgtt ccctgcagag agcctgatac ccccctacgg cctgacagca 840 
gatacatata tcgacctggc agctgatgag gagggcctgt gggctgtcta tgccactcga 900 
gatgatgaca ggcatttgtg tctagccaag ttagacccac agacacttga cacagagcag 960 
cagtgggaca caccatgtcc cagagagaac gcagaggctg cgtttgtcat ctgtgggacc 1020 
ctgtacgttg tctataacac ccgccctgcc agtagggctc gtattcagtg ttccttcgat 1080 
gccagtggta ctctcgcccc tgaaagggca gcactctcct attttccacg ccgatatggt 1140 
gcccatgcca gccttcgcta taacccccgt gagcgccagc tgtatgcctg ggatgatggc 1200 
taccagattg tctacaaatt ggagatgaag aagaaggagg aggaagttta agcagctagc 1260 
cttgtgctct tgattcttat gcccagacat ttatattcct gtgagctctc ctgcagttca 1320 
tccttcaaaa cgaaggccag tggtggtagc tcatataccc taatttctaa aggacaacca 1380 
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aattctcaag cccctctgtt ttatgcagaa ctccagatcc tgggtagcat tttagaactg 
aacagcaaac aaacacccta aatcttcact cctgccttat gtccacaaag tttagttcca 
aactcagagc cctgtccttt ggagagggtc aaccccagac agcaggcgac agcattcttg 
ccctcagtat gaccgaaggg agagaactca gagacaaagc tgccctccct cccttccccc 
tccagtgtag gggagaatgg ggctttcccc acatcacttt gtatggtaac agtttgcatt 
aaaaggaaaa cccaccaaaa aaaaaaaaaa agggcggccg c 

<210> 134 
<211> 370 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 

<400> 134 ^ ^ 

Met He Ser Leu Pro Gly Pro Leu Val Thr Asn Leu Xaa Arg Phe Leu 

15 10 15 

Phe Leu Gly Leu Ser Ala Leu Ala Pro Pro Ser Arg Ala Gin Leu Gin 

20 25 30 

Leu His Leu Pro Ala Asn Arg Leu Gin Ala Val Glu Glu Gly Glu Ser 

35 40 45 

Gly Ala Ser Ala Trp Tyr Thr Leu His Arg Glu Ala Ser Ser Ser Gin 

50 55 60 

Pro Trp Glu Val Pro Phe Val Met Trp Phe Phe Lys Gin Lys Glu Lys 
65 70 75 80 

Glu Asp Gin Val Leu Ser Tyr lie Asn Gly Val Thr Thr Ser Lys Pro 

85 90 95 

Gly Val Ser Leu Val Tyr Ser Met Pro Ser Arg Asn Leu Ser Leu Arg 

100 105 HO 

Val Glu Gly Leu Gin Glu Lys Asp Ser Gly Pro Tyr Ser Cys Ser Val 

115 120 125 

Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 

130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 

165 170 175 

Pro Arg Ser Lys Pro Ala Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 

180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 

195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 

210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 

245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 

260 265 270 

Arg Arg Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 

275 280 285 

Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 

290 295 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 320 



1440 
1500 
1560 
1620 
1680 
1721 
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Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 

325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 

340 345 350 

Gly Pro Pro Ser Thr Asn lie Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 



<210> 135 

<211> 1869 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> modif ied_base 

<222> all "n" positions 

<223> n=a, c, g, or t 



<400> 135 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 24 0 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg cgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 42 0 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 54 0 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 84 0 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 132 0 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 1380 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 
ggcggccgc 

<210> 136 
<211> 370 
<212> PRT 

<213> Homo sapiens 



1869 
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<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 136 



Met 


lie 


Ser Leu Pro Gly 


Pro 


Leu 


Val Thr Asn 


Leu Xaa Arg 


Phe Leu 


1 






5 








10 








15 


Phe 


Leu 


Gly Leu Ser Ala 


Leu 


Ala 


Pro Pro Ser 


Arg Ala Gin Leu Gin 






20 










25 






30 




Leu 


His 


Leu Pro 


Ala 


Asn 


Arg 


Leu 


Gin Ala Val 


Glu Glu Gly Glu Ser 






35 








40 






45 






Gly 


Ala 


Ser Ala 


Trp Tyr 


Thr 


Leu 


His Arg Glu 


Val 


Ser 


Ser 


Ser Gin 




50 








55 






60 








Pro 


Trp 


Glu Val 


Pro 


Phe 


Val 


Met 


Trp Phe Phe 


Lys 


Gin 


Lys 


Glu Lys 


65 








70 






75 








80 


Glu 


Asp 


Gin Val 


Leu 
85 


Ser 


Tyr 


He 


Asn Gly Val 
90 


Thr 


Thr 


Ser 


Lys Pro 
95 


Gly 


Val 


Ser Leu Ala Tyr 


Ser 


Met 


Pro Ser Arg 


Asn 


Leu 


Ser 


Leu Arg 






100 










105 






110 




Val 


Glu 


Gly Leu Gin Glu 


Lys 


Asp 


Ser Gly Pro 


Tyr 


Ser 


Cys 


Ser Val 






115 








120 






125 






Asn 


Val 


Gin Asp Lys 


Gin 


Gly Lys 


Ser Arg Gly 


His 


Ser 


He 


Lys Thr 




130 








135 






140 








Leu 


Glu 


Leu Asn 


Val 


Leu 


Val 


Pro 


Pro Ala Pro 


Pro 


Ser 


Cys 


Arg Leu 


145 








150 






155 








160 


Gin 


Gly 


Val Pro 


His 


Val 


Gly Ala Asn Val Thr 


Leu 


Ser Cys 


Gin Ser 








165 








170 








175 


Pro 


Arg 


Ser Lys 


Pro 


Ala 


Val 


Gin Tyr Gin Trp 


Asp Arg 


Gin 


Leu Pro 






180 










185 






190 




Ser 


Phe 


Gin Thr 


Phe 


Phe 


Ala 


Pro 


Ala Leu Asp 


Val 


He Arg Gly Ser 






195 








200 






205 






Leu 


Ser 


Leu Thr 


Asn 


Leu 


Ser 


Ser 


Ser Met Ala 


Gly Val 


Tyr Val Cys 




210 








215 






220 








Lys 


Ala 


His Asn 


Glu 


Val 


Gly Thr Ala Gin Cys 


Asn 


Val 


Thr 


Leu Glu 


225 








230 






235 








240 


Val 


Ser 


Thr Gly 


Pro Gly 


Ala 


Ala 


Val Val Ala 


Glu 


Ala 


Val 


Val Gly 








245 








250 








255 


Thr 


Leu 


Val Gly Leu Gly 


Leu Leu Ala Gly Leu 


Val Leu Leu Tyr His 






260 










265 






270 




Arg 


Arg 


Gly Lys Ala Leu 


Glu 


Glu 


Pro Ala Asn 


Asp 


lie 


Lys 


Glu Asp 






275 








280 






285 






Ala 


He 


Ala Pro 


Arg 


Thr 


Leu 


Pro 


Trp Pro Lys 


Ser Ser Asp 


Thr lie 




2 90 








295 






300 








Ser 


Lys 


Asn Gly Thr 


Leu 


Ser 


Ser 


Val Thr Ser 


Ala Arg Ala Leu Arg 


305 








310 






315 








320 


Pro 


Pro 


His Gly Pro 


Pro 


Arg Pro Gly Ala Leu 


Thr 


Pro 


Thr 


Pro Ser 








325 








330 








335 


Leu 


Ser 


Ser Gin 


Ala 


Leu 


Pro 


Ser 


Pro Arg His 


Ala His Asp Arg Trp 






340 










345 






350 




Gly 


Pro 


Pro Ser 


Thr 


Asn 


He 


Pro 


His Pro Trp 


Trp Gly Phe 


Phe Leu 






355 








360 






365 







Trp Leu 
370 



<210> 137 
<211> 1869 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 

<222> all "n H positions 

<223> n=a, c, g, or t 



<400> 137 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 

gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 

cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 24 0 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 3 00 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 42 0 

gggtcacaac aagcaaacct ggagtatcct tggcctactc catgccctcc cggaacctgt 480 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 54 0 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 84 0 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 1020 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 108 0 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 1140 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 1200 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 13 80 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 
ggcggccgc 

<210> 138 
<211> 370 
<212> PRT 

<213> Homo sapiens 



1869 



<220> 

<221> SITE 

<222> (13) 

<223> Xaa=unknown amino acid 



<400> 138 
Met lie Ser Leu Pro Gly Pro Leu 

1 5 
Phe Leu Gly Leu Ser Ala Leu Ala 
20 



Val Thr Asn Leu Xaa Arg Phe Leu 

10 15 
Pro Pro Ser Arg Ala Gin Leu Gin 
25 30 
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Leu His 


Leu 


Pro 


Ala Asn Arg 


Leu 


Gin 


Ala 


Val 


Glu 


Glu 


Gly 


Glu Ser 




35 








40 










45 






Gly Ala 


Ser 


Ala 


Trp Tyr 


Thr 


Leu 


His 


Arg 


Glu 


Val 


Ser 


O A V 

ser 


ser Gin 


50 








55 










60 








Pro Trp 


Glu 


val 


Pro Phe 


Val 


Met 


Trp 


Phe 


Phe 


Lys 


Gin 


Lys 


Glu Lys 


65 






70 










75 








80 


Glu Asp 


Gin 


Val 


Leu Ser 
85 


Tyr 


He 


Asn 


Gly 
90 


Val 


Thr 


Thr 


Ser 


Lys Pro 
95 


Gly Val 


Ser 


Leu 


Val Tyr 


Ser 


Met 


Pro 


Ser 


Arg 


Asn 


Leu 


Ser 


Leu Arg 




100 








105 










110 




Val Glu 


Gly 
115 


Leu 


Gin Glu 


Lys 


Asp 
120 


Ser 


Gly 


Pro 


Tyr 


Ser 
125 


Cys 


Ser Val 


Asn Val 


Gin 


Asp 


Lys Gin Gly 


Lys 


Ser 


Arg 


Gly His 


Ser 


He 


Lys Thr 


130 








135 










140 








Leu Glu 


Leu 


Asn 


Val Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


Cys 


Arg He 


145 






150 










155 








160 


Gin Gly 


Val 


Pro 


His Val Gly 


Ala 


Asn 


Val 


Thr 


Leu 


Ser 


Cys 


Gin Ser 








165 








170 










175 


Pro Arg 


Ser 


Lys 


Pro Ala 


Val 


Gin 


Tyr 


Gin 


Trp Asp 


Arg 


Gin 


Leu Pro 






180 








185 










190 




Ser Phe 


Gin 
195 


Thr 


Phe Phe 


Ala 


Pro 
200 


Ala 


Leu 


Asp 


Val 


He 
205 


Arg 


Gly Ser 


Leu Ser 


Leu 


Thr 


Asn Leu 


Ser 


Ser 


Ser 


Met 


Ala 


Gly 


Val 


Tyr 


Val Cys 


210 








215 










220 








Lys Ala 


His 


Asn 


Glu Val 


Gly 


Thr 


Ala 


Gin 


Cys 


Asn 


Val 


Thr 


Leu Glu 


225 






230 










235 








240 


Val Ser 


Thr 


Gly 


Pro Gly Ala 


Ala 


Val 


Val 


Ala 


Glu 


Ala 


Val 


Val Gly 








245 








250 










255 


Thr Leu 


Val 


Gly 


Leu Gly Leu 


Leu 


Ala 


Gly 


Leu 


Val 


Leu 


Leu 


Tyr His 






260 








265 










270 




Arg Arg 


Gly 


Lys 


Ala Leu 


Glu 


Glu 


Pro 


Ala 


Asn Asp 


He 


Lys 


Glu Asp 




275 








280 










285 






Ala He 


Ala 


Pro 


Arg Thr 


Leu 


Pro 


Trp 


Pro 


Lys 


Ser 


Ser 


Asp 


Thr He 


290 








295 










300 








Ser Lys 


Asn 


Gly 


Thr Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


Ala 


Leu Arg 


305 






310 










315 








320 


Pro Pro 


His 


Gly 


Pro Pro 
325 


Arg 


Pro 


Gly 


Ala 
330 


Leu 


Thr 


Pro 


Thr 


Pro Ser 
335 


Leu Ser 


Ser 


Gin 
340 


Ala Leu 


Pro 


Ser 


Pro 
345 


Arg 


His 


Ala 


His 


Asp 
350 


Arg Trp 


Gly Pro 


Pro 
355 


Ser 


Thr Asn 


He 


Pro 
360 


His 


Pro 


Trp 


Trp 


Gly 
365 


Phe 


Phe Leu 



Trp Leu 
370 



<210> 139 

<211> 1869 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 

<222> all "n M positions 

<223> n=a, c, g, or t 

<400> 139 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 60 
gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 120 
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cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 

ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 

agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 

cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 360 

ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 

gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 4 80 

ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 54 0 

tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac 600 

tggttcctcc agctcctcca tcctgccgta tccagggtgt gccccatgtg ggggcaaacg 660 

tgaccctgag ctgccagtct ccaaggagta agcccgctgt ccaataccag tgggatcggc 720 

agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 780 

gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 

tgggcactgc ccaatgtaat gtgacgctgg aagtgagcac agggcctgga gctgcagtgg 900 

ttgctgaagc tgttgtgggt accctggttg gactggggtt gctggctggg ctggtcctct 960 

tgtaccaccg ccggggcaag gccctggagg agccagccaa tgatatcaag gaggatgcca 102 0 

ttgctccccg gaccctgccc tggcccaaga gctcagacac aatctccaag aatgggaccc 1080 

tttcctctgt cacctccgca cgagccctcc ggccacccca tggccctccc aggcctggtg 114 0 

cattgacccc cacgcccagt ctatccagcc aggccctgcc ctcaccaaga catgcccacg 12 00 

acagatgggg cccaccctca accaatatcc cccatccctg gtggggtttt ttcctttggc 1260 

tttgagccgc atgggtgctg ngcctgtgat ggngcctgcc cagagtcaag ctggctctct 1320 

ggtatgatga ccccaccact cattggctaa aggatttggg gtctctcctt cctataaggg 13 80 

tcacctctag cacagaggcc tgagtcatgg gaaagagtca cactcctgac ccttagtact 1440 

ctgcccccac ctctctttac tgtgggaaaa ccatctcagt aagacctaag tgtccaggag 1500 

acagaaggag aagaggaagt ggatctggaa ttgggaggag cctccaccca cccctgactc 1560 

ctccttatga agccagctgc tgaaattagc tactcaccaa gagtgagggg cagagacttc 1620 

cagtcactga gtctcccagg cccccttgat ctgtacccca cccctatcta acaccaccct 1680 

tggctcccac tccagctccc tgtattgata taacctgtca ggctggcttg gttaggtttt 1740 

actggggcag aggataggga atctcttatt aaaactaaca tgaaatatgt gttgttttca 1800 

tttgcaaatt taaataaaga tacataatgt ttgtatgaga taagaaaaaa aaaaaaaaag 1860 

ggcggccgc 1869 

<210> 140 
<211> 370 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa=unknown amino acid 



<400> 140 



Met 


He 


Ser 


Leu 


Pro 


Gly 


Pro 


Leu 


Val 


Thr 


Asn 


Leu 


Xaa 


Arg 


Phe 


Leu 


1 








5 










10 










15 




Phe 


Leu 


Gly 


Leu 
20 


Ser 


Ala 


Leu 


Ala 


Pro 
25 


Pro 


Ser 


Arg 


Ala 


Gin 
30 


Leu 


Gin 


Leu 


His 


Leu 
35 


Pro 


Ala 


Asn 


Arg 


Leu 
40 


Gin 


Ala 


Val 


Glu 


Glu 
45 


Gly 


Glu 


Ser 


Gly 


Ala 
50 


Ser 


Ala 


Trp 


Tyr 


Thr 
55 


Leu 


His 


Arg 


Glu 


Val 
60 


Ser 


Ser 


Ser 


Gin 


Pro 


Trp 


Glu 


Val 


Pro 


Phe 


Val 


Met 


Trp 


Phe 


Phe 


Lys 


Gin 


Lys 


Glu. 


Lys 


65 










70 










75 










80 


Glu 


Asp 


Gin 


Val 


Leu 
85 


Ser 


Tyr 


He 


Asn 


Gly 
90 


Val 


Thr 


Thr 


Ser 


Lys 
95 


Pro 


Gly 


Val 


Ser 


Leu 
100 


Val 


Tyr 


Ser 


Met 


Pro 
105 


Ser 


Arg 


Asn 


Leu 


Ser 
110 


Leu 


Arg 


Val 


Glu 


Gly 
115 


Leu 


Gin 


Glu 


Lys 


Asp 
120 


Ser 


Gly 


Pro 


Tyr 


Ser 
125 


Cys 


Ser 


Val 
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Asn Val Gin Asp Lys Gin Gly Lys Ser Arg Gly His Ser He Lys Thr 

130 135 140 

Leu Glu Leu Asn Val Leu Val Pro Pro Ala Pro Pro Ser Cys Arg Leu 
145 150 155 160 

Gin Gly Val Pro His Val Gly Ala Asn Val Thr Leu Ser Cys Gin Ser 

165 170 175 

Pro Arg Ser Lys Pro Val Val Gin Tyr Gin Trp Asp Arg Gin Leu Pro 

180 185 190 

Ser Phe Gin Thr Phe Phe Ala Pro Ala Leu Asp Val He Arg Gly Ser 

195 200 205 

Leu Ser Leu Thr Asn Leu Ser Ser Ser Met Ala Gly Val Tyr Val Cys 

210 215 220 

Lys Ala His Asn Glu Val Gly Thr Ala Gin Cys Asn Val Thr Leu Glu 
225 230 235 240 

Val Ser Thr Gly Pro Gly Ala Ala Val Val Ala Glu Ala Val Val Gly 

245 250 255 

Thr Leu Val Gly Leu Gly Leu Leu Ala Gly Leu Val Leu Leu Tyr His 

260 265 270 

Ara Ara Gly Lys Ala Leu Glu Glu Pro Ala Asn Asp He Lys Glu Asp 

27 5 280 285 

Ala He Ala Pro Arg Thr Leu Pro Trp Pro Lys Ser Ser Asp Thr He 

290 295 300 

Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg Ala Leu Arg 
305 310 315 320 

Pro Pro His Gly Pro Pro Arg Pro Gly Ala Leu Thr Pro Thr Pro Ser 

325 330 335 

Leu Ser Ser Gin Ala Leu Pro Ser Pro Arg His Ala His Asp Arg Trp 

340 345 350 

Gly Pro Pro Ser Thr Asn He Pro His Pro Trp Trp Gly Phe Phe Leu 
355 360 365 

Trp Leu 
370 

<210> 141 
<211> 1869 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 141 

gtcgacccac gcgtncntcc agcgtncgga gccgccctgg gtgtcagcgg ctcggctccc 
gcgcacgctc cggccgtcgc gcagcctcgg cacctgcagg tccgtgcgtc ccgcggctgg 
cgcccctgac tccgtcccgg ccagggaggg ccatgatttc cctcccgggg cccctggtga 180 
ccaacttgnt gcggtttttg ttcctggggc tgagtgccct cgcgcccccc tcgcgggccc 240 
agctgcaact gcacttgccc gccaaccggt tgcaggcggt ggaggagggg gaaagtggtg 300 
cttcagcatg gtacaccttg cacagggagg tgtcttcatc ccagccatgg gaggtgccct 3 60 
ttgtgatgtg gttcttcaaa cagaaagaaa aggaggatca ggtgttgtcc tacatcaatg 420 
gggtcacaac aagcaaacct ggagtatcct tggtctactc catgccctcc cggaacctgt 480 
ccctgcgggt ggagggtctc caggagaaag actctggccc ctacagctgc tccgtgaatg 540 
tgcaagacaa acaaggcaaa tctaggggcc acagcatcaa aaccttagaa ctcaatgtac «™ 
tggttcctcc agctcctcca tcctgccgtc tccagggtgt gccccatgtg ggggcaaacg 
tgaccctgag ctgccagtct ccaaggagta agcccgttgt ccaataccag tgggatcggc 
agcttccatc cttccagact ttctttgcac cagcattaga tgtcatccgt gggtctttaa 
gcctcaccaa cctttcgtct tccatggctg gagtctatgt ctgcaaggcc cacaatgagg 840 



60 
120 



600 
660 
720 
780 
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tgggcactgc 
ttgctgaagc 
tgtaccaccg 
ttgctccccg 
tttcctctgt 
cattgacccc 
acagatgggg 
tttgagccgc 
ggtatgatga 
tcacctctag 
ctgcccccac 
acagaaggag 
ctccttatga 
cagtcactga 
tggctcccac 
actggggcag 
tttgcaaatt 
ggcggccgc 



ccaatgtaat 
tgttgtgggt 
ccggggcaag 
gaccctgccc 
cacctccgca 
cacgcccagt 
cccaccctca 
atgggtgctg 
ccccaccact 
cacagaggcc 
ctctctttac 
aagaggaagt 
agccagctgc 
gtctcccagg 
tccagctccc 
aggataggga 
taaataaaga 



gtgacgctgg 
accctggttg 
gccctggagg 
tggcccaaga 
cgagccctcc 
ctatccagcc 
accaatatcc 
ngcctgtgat 
cattggctaa 
tgagtcatgg 
tgtgggaaaa 
ggatctggaa 
tgaaattagc 
cccccttgat 
tgtattgata 
atctcttatt 
tacataatgt 



aagtgagcac 
g a ctggggtt 
agccagccaa 
gctcagacac 
ggccacccca 
aggccctgcc 
cccatccctg 
ggngcctgcc 
a gg a tttggg 
gaaagagtca 
ccatctcagt 
ttgggaggag 
tactcaccaa 
ctgtacccca 
taacctgtca 
aaaactaaca 
ttgtatgaga 



agggcctgga 
gctggctggg 
tgatatcaag 
aatctccaag 
tggccctccc 
ctcaccaaga 
gtggggtttt 
cagagtcaag 
gtctctcctt 
cactcctgac 
aagacctaag 
cctccaccca 
gagtgagggg 
cccctatcta 
ggctggcttg 
tgaaatatgt 
taagaaaaaa 



gctgcagtgg 
ctggtcctct 
gaggatgcca 
aatgggaccc 
aggcctggtg 
catgcccacg 
ttcctttggc 
ctggctctct 
cctataaggg 
ccttagtact 
tgtccaggag 
cccctgactc 
cagagacttc 
acaccaccct 
gttaggtttt 
gttgttttca 
aaaaaaaaag 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1869 



<210> 142 

<211> 394 

<212> PRT 

<213> Mus musculus 



<400> 142 



Met 


He 


Leu 


Gin 


Ala 


Gly 


Thr 


Pro 


Glu 


Thr 


Ser 


Leu 


Leu 


Arg 


Val 


Leu 


1 








5 










10 










15 




Phe 


Leu 


Gly 


Leu 


Ser 


Thr 


Leu 


Ala 


Ala 


Phe 


Ser 


Arg 


Ala 


Gin 


Met 


Glu 








20 










25 










30 






Leu 


His 


Val 


Pro 


Pro 


Gly 


Leu 


Asn 


Lys 


Leu 


Glu 


Ala 


Val 


Glu 


Gly 


Glu 






35 










40 










45 








Glu 


Val 


Val 


Leu 


Pro 


Ala 


Trp 


Tyr 


Thr 


Met 


Ala 


Arg 


Glu 


Glu 


Ser 


Trp 




50 










55 










60 










Ser 


His 


Pro 


Arg 


Glu 


Val 


Pro 


He 


Met 


He 


Trp 


Phe 


Leu 


Glu 


Gin 


Glu 


65 










70 










75 










80 


Gly 


Lys 


Glu 


Pro 


Asn 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Gly 


Val 


Met 


Thr 










85 










90 










95 




Asn 


Lys 


Pro 


Gly 


Thr 


Ala 


Leu 


Val 


His 


Ser 


He 


Ser 


Ser 


Arg 


Asn 


Val 








100 










105 










110 






Ser 


Leu 


Arg 


Leu 


Gly 


Ala 


Leu 


Gin 


Glu 


Gly 


Asp 


Ser 


Gly 


Thr 


Tyr 


Arg 






115 










120 










125 








Cys 


Ser 


Val 


Asn 


Val 


Gin 


Asn 


Asp 


Glu 


Gly 


Lys 


Ser 


He 


Gly 


His 


Ser 




130 










135 










140 










He 


Lys 


Ser 


He 


Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


145 










150 










155 










160 


Cys 


Ser 


Leu 


Gin 


Gly 


Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val 


Thr 


Leu 


Asn 










165 










170 










175 




Cys 


Lys 


Ser 


Pro 


Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin 


Tyr 


Gin 


Trp 


Glu 


Arg 








180 










185 










190 






Leu 


Ala 


Pro 


Ser 


Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu 


Asp 


Ala 


Val 






195 










200 










205 








Arg 


Gly 


Ser 


Leu 


Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met 


Ser 


Gly 


Val 




210 










215 










220 










Tyr 


Val 


Cys 


Lys 


Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


Lys 


Cys 


Asn 


Val 


225 










230 










235 










240 


Thr 


Leu 


Asp 


val 


Met 


Thr 


Gly 


Ser 


Lys 


Ala 


Ala 


Val 


Val 


Ala 


Gly 


Ala 










245 










250 










255 




Val 


Val 


Gly 


Thr 


Phe 


Val 


Gly 


Leu 


Val 


Leu 


He 


Ala 


Gly 


Leu 


Val 


Leu 
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260 








265 










270 






Leu 


Tyr 


Gin 
275 


Arg Arg 


Ser 


Lys 


Thr 
280 


Leu 


Glu 


Glu 


Leu 


Ala 
285 


Asn 


Asp 


T 1 A 

lie 


Lys 


Glu 
290 


Asp 


Ala He 


Ala 


Pro 
295 


Arg 


Thr 


Leu 


Pro 


Trp 
300 


Thr 


Lys 


Gly 


Ser 


Asp 


Thr 


He 


Ser Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


305 








310 










315 










320 


Ala 


Leu 


Arg 


Pro Pro Lys 


Ala 


Ala 


Pro 


Pro 


Arg 


Pro 


Gly 


Thr 


Phe 


Thr 








325 










330 










335 




Pro 


Thr 


Pro 


Ser Val 
340 


Ser 


Ser 


Gin 


Ala 
345 


Leu 


Ser 


Ser 


Pro 


Arg 
350 


Leu 


Pro 


Arg 


Val 


Asp 
355 


Glu Pro 


Pro 


Pro 


Gin 
360 


Ala 


Val 


Ser 


Leu 


Thr 
365 


Pro 


Gly 


Gly 


Val 


Ser 
370 


Ser 


Ser Ala 


Leu 


Ser 
375 


Arg 


Met 


Gly 


Ala 


Val 
380 


Pro 


Val 


Met 


Val 


Pro 


Ala 


Gin 


Ser Gin 


Ala 


Gly 


Ser 


Leu 


Val 














385 








390 























<210> 143 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 143 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 24 0 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 3 00 

ccacccccgg gaggtgccca tcatgatctg gttcttggaa caagaaggga aggaaccaaa 3 60 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 4 80 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 102 0 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc * 114 0 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 12 00 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 184 6 

<210> 144 
<211> 394 
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<212> PRT 

<213> Mus musculus 
<400> 144 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 

15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 

20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 

35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 

50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 
65 ^ 70 75 80 

Gly Lys Glu Pro Asn Gin Val Leu Ser Tyr He Asn Gly Val Met Thr 

85 90 95 

Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 

100 105 110 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 

115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser He Gly His Ser 

130 135 140 

He Lys Ser He Glu Leu Lys Ala Leu Val Pro Pro Ala Pro Pro Ser 
145 150 155 160 

Cys Ser Leu Gin Gly Val Pro Tyr Val Gly Thr Asn Val Thr Leu Asn 

165 170 175 

Cys Lys Ser Pro Arg Ser Lys Pro Thr Ala Gin Tyr Gin Trp Glu Arg 

180 185 190 

Leu Ala Pro Ser Ser Gin Val Phe Phe Gly Pro Ala Leu Asp Ala Val 

195 200 205 

Arg Gly Ser Leu Lys Leu Thr Asn Leu Ser He Ala Met Ser Gly Val 

210 215 220 

Tyr Val Cys Lys Ala Gin Asn Arg Val Gly Phe Ala Lys Cys Asn Val 
225 230 235 240 

Thr Leu Asp Val Met Thr Gly Ser Lys Ala Ala Val Val Ala Gly Ala 

245 250 255 

Val Val Gly Thr Phe Val Gly Leu Val Leu He Ala Gly Leu Val Leu 

260 265 270 

Leu Tyr Gin Arg Arg Ser Lys Thr Leu Glu Glu Leu Ala Asn Asp He 

275 280 285 

Lys Glu Asp Ala He Ala Pro Arg Thr Leu Pro Trp Thr Lys Gly Ser 

290 295 300 

Asp Thr He Ser Lys Asn Gly Thr Leu Ser Ser Val Thr Ser Ala Arg 
305 310 315 320 

Ala Leu Arg Pro Pro Lys Ala Ala Pro Pro Arg Pro Gly Thr Phe Thr 

325 330 335 

Pro Thr Pro Ser Val Ser Ser Gin Ala Leu Ser Ser Pro Arg Leu Pro 

340 345 350 

Arg Val Asp Glu Pro Pro Pro Gin Ala Val Ser Leu Thr Pro Gly Gly 

355 360 365 

Val Ser Ser Ser Ala Leu Ser Arg Met Gly Ala Val Pro Val Met Val 

370 375 380 

Pro Ala Gin Ser Gin Ala Gly Ser Leu Val 
385 390 
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<211> 1846 
<212> DNA 
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213> Mus musculus 
400> 145 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 540 

caaaagcata gagctcaaag cgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 840 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctgct cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 12 60 

tgcacagagt caggctgggt ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gaagatcggc taccctccaa gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gacattttaa aatctgactt gaaattgatg ttt'ttgtttt tattttgcaa atttcaataa 1800 

agatacatcg catttgcatg gaaaaaaaaa aaaaaagggc ggccgc 184 6 

<210> 146 
<211> 394 
<212> PRT 
<213> Mus musculus 

<400> 146 

Met He Leu Gin Ala Gly Thr Pro Glu Thr Ser Leu Leu Arg Val Leu 

15 10 15 

Phe Leu Gly Leu Ser Thr Leu Ala Ala Phe Ser Arg Ala Gin Met Glu 

20 25 30 

Leu His Val Pro Pro Gly Leu Asn Lys Leu Glu Ala Val Glu Gly Glu 

35 40 45 

Glu Val Val Leu Pro Ala Trp Tyr Thr Met Ala Arg Glu Glu Ser Trp 

50 55 60 

Ser His Pro Arg Glu Val Pro He Leu He Trp Phe Leu Glu Gin Glu 



Asn Lys Pro Gly Thr Ala Leu Val His Ser He Ser Ser Arg Asn Val 

100 105 HO 

Ser Leu Arg Leu Gly Ala Leu Gin Glu Gly Asp Ser Gly Thr Tyr Arg 

115 120 125 

Cys Ser Val Asn Val Gin Asn Asp Glu Gly Lys Ser lie Gly His Ser 
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130 










135 










140 










lie 


Lys 


Ser 


He 


Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala 


Pro 


Pro 


Ser 


145 










150 










155 










160 


Cys 


Ser 


Leu 


Gin 


Gly Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val 


Thr 


Leu 


Asn 










165 










170 










175 




Cys 


Lys 


Ser 


Pro 


Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin 


Tyr 


Gin 


Trp 


Glu 


Arg 








180 










185 










190 






Leu 


Val 


Pro 


Ser 


Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu 


Asp 


Ala 


Val 






195 










200 










205 








Arg 


Gly 


Ser 


Leu 


Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met 


Ser 


Gly 


Val 




210 










215 










220 










Tyr 


Val 


Cys 


Lys 


Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


Lys 


Cys 


Asn 


Val 


225 










230 










235 










240 


Thr 


Leu 


Asp 


val 


Met 


Thr 


Gly 


Ser 


Lys 


Ala 


Ala 


Val 


Val 


Ala 


Gly 


Ala 










245 










250 










255 




Val 


Val 


Gly Thr 


Phe 


Val 


Gly 


Leu 


Val 


Leu 


He 


Ala 


Gly 


Leu 


Val 


Leu 








260 










265 










270 






Leu 


Tyr 


Gin Arg 


Arg 


Ser 


Lys 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala 


Asn 


Asp 


lie 






275 










280 










285 








Lys 


Glu 


Asp Ala 


He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro 


Trp 


Thr 


Lys 


Gly 


Ser 




290 










295 










300 










Asp 


Thr 


He 


Ser 


Lys 


Asn 


Gly 


Thr 


Leu 


Ser 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


305 










310 










315 










320 


Ala 


Leu 


Arg 


Pro 


Pro Lys 


Ala 


Ala 


Pro 


Pro 


Arg 


Pro 


Gly 


Thr 


Phe 


Thr 










325 










330 










335 




Pro 


Thr 


Pro 


Ser 


Val 


Ser 


Ser 


Gin 


Ala 


Leu 


Ser 


Ser 


Pro 


Arg 


Leu 


Pro 








340 










345 










350 






Arg 


Val 


Asp 


Glu 


Pro 


Pro 


Pro 


Gin 


Ala 


Val 


Ser 


Leu 


Thr 


Pro 


Gly 


Gly 






355 










360 










365 








Val 


Ser 


Ser 


Ser 


Ala 


Leu 


Ser 


Arg 


Met 


Gly 


Ala 


Val 


Pro 


Val 


Met 


Val 




370 










375 










380 










Pro 


Ala 


Gin 


Ser 


Gin 


Ala 


Gly 


Ser 


Leu 


Val 















385 390 



<210> 147 
<211> 1846 
<212> DNA 

<213> Mus musculus 
<400> 147 

gtcgacccac gcgtccggtg cacattcggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc cgggccaagg agggccatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 240 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg gcacgggagg agtcgtggtc 300 

ccacccccgg gaggtgccca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 42 0 

ctctatctct tcacggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 480 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtcgggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgctcagtac cagtgggaga ggctggtccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgccatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 
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caaggctgct cctccaagac 

cctgtcctca ccaagactgc 

cccaggtggg gtttcttctt 

tgcacagagt caggctgggt 

tatctgacct ttctgtaaag 

ccacattcta gacctccagt 

tcagtaagac taaaatctgg 

ttgggagtga ggaggcttca 

gaagatcggc taccctccaa 

ctgtgatctg tacaacaccc 

tattaatata acctgtcctg 

gacattttaa aatctgactt 

agatacatcg catttgcatg 

<210> 148 
<211> 394 
<212> PRT 

<213> Mus musculus 



<400> 148 



Met 


He 


Leu 


Gin 


Ala 


Gly 


Thr 


Pro 


Glu 


Thr 


Ser 


Leu 


Leu Arg Val Leu 


1 








5 










10 






15 


Phe 


Leu 


Gly 


Leu 


Ser 


Thr 


Leu 


Ala 


Ala 


Phe 


Ser 


Arg 


Ala Gin Met Glu 








20 










25 








30 


Leu 


His 


Val 


Pro 


Pro 


Gly 


Leu 


Asn 


Lys 


Leu 


Glu 


Ala 


Val Glu Gly Glu 






35 










40 










45 


Glu 


Val 


Val 


Leu 


Pro 


Ala 


Trp 


Tyr 


Thr 


Met 


Ala 


Arg 


Glu Glu Ser Trp 




50 










55 










60 




Ser 


His 


Pro 


Arg 


Glu 


Val 


Pro 


He 


Leu 


He 


Trp 


Phe 


Leu Glu Gin Glu 


65 










70 










75 




80 


Gly 


Lys 


Glu 


Pro 


Asn 


Gin 


Val 


Leu 


Ser 


Tyr 


He 


Asn 


Gly Val Met Thr 










85 










90 






95 


Asn 


Lys 


Pro 


Gly 


Thr 


Ala 


Leu 


Val 


His 


Ser 


He 


Ser 


Ser Arg Asn Val 








100 










105 








110 


Ser 


Leu 


Arg 


Leu 


Gly 


Ala 


Leu 


Gin 


Glu 


Gly 


Asp 


Ser 


Gly Thr Tyr Arg 






115 










120 










125 


Cys 


Ser 


Val 


Asn 


Val 


Gin 


Asn 


Asp 


Glu 


Gly 


Lys 


Ser 


He Gly His Ser 




130 










135 










140 




He 


Lys 


Ser 


He 


Glu 


Leu 


Lys 


Val 


Leu 


Val 


Pro 


Pro 


Ala Pro Pro Ser 


145 










150 










155 




160 


Cys 


Ser 


Leu 


Gin 


Gly 


Val 


Pro 


Tyr 


Val 


Gly 


Thr 


Asn 


Val Thr Leu Asn 










165 










170 






175 


Cys 


Lys 


Ser 


Pro 


Arg 


Ser 


Lys 


Pro 


Thr 


Ala 


Gin Tyr 


Gin Trp Glu Arg 








180 










185 








190 


Leu 


Ala 


Pro 


Ser 


Ser 


Gin 


Val 


Phe 


Phe 


Gly 


Pro 


Ala 


Leu Asp Ala Val 






195 










200 










205 


Arg 


Gly 


Ser 


Leu 


Lys 


Leu 


Thr 


Asn 


Leu 


Ser 


He 


Ala 


Met Ser Gly Val 




210 










215 










220 




Tyr 


Val 


Cys 


Lys 


Ala 


Gin 


Asn 


Arg 


Val 


Gly 


Phe 


Ala 


Lys Cys Asn Val 


225 










230 










235 




240 


Thr 


Leu 


Asp 


Val 


Met 


Thr 


Gly 


Ser 


Lys 


Ala 


Ala 


Val 


Val Ala Gly Ala 










245 










250 






255 


Val 


Val 


Gly 


Thr 


Phe 


Val 


Gly 


Leu 


Val 


Leu 


He 


Ala 


Gly Leu Val Leu 








260 










265 








270 


Leu 


Tyr 


Gin 


Arg 


Arg 


Ser 


Lys 


Thr 


Leu 


Glu 


Glu 


Leu 


Ala Asn Asp He 






275 










280 










285 


Lys 


Glu 


Asp 


Ala 


He 


Ala 


Pro 


Arg 


Thr 


Leu 


Pro Trp 


Thr Lys Gly Ser 




290 










295 










300 





ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

ctgctctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

ctcttgtgtg atagcccagg cactcattag ctacatctgg 1320 

gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

cctttgctcc tacctccttc tattgttgga atactgggcc 1440 

gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

cttcctccct gcttctccct gaagccagat gaatgctgcg 1560 

gggctctgga ggagactgcc agtcagtgat gcccctggct 1620 

ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 1740 

gaaattgatg tttttgtttt tattttgcaa atttcaataa 1800 

gaaaaaaaaa aaaaaagggc ggccgc 184 6 
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Asp 


inr 


lie 


Ser 


Lys 


Asn 


uiy 


inr 


Leu 


ber 


Ser 


Val 


Thr 


Ser 


Ala 


Arg 


.JUS 










310 










315 










320 


Ala 


Leu 


Arg 


Pro Pro Lys 


Ala 


Ala 


Pro 


Pro 


Arg 


Pro Gly Thr 


Phe 


Thr 










325 










330 










335 




Pro 


Thr 


Pro 


Ser 


Val 


Ser 


Ser 


Gin 


Ala 


Leu 


Ser 


Ser 


Pro 


Arg 


Leu 


Pro 








340 










345 










350 






Arg 


val 


Asp 


Glu 


Pro 


Pro 


Pro 


Gin 


Ala 


Val 


Ser 


Leu 


Thr 


Pro 


Gly Gly 






355 










360 










365 








Val 


Ser 


Ser 


Ser 


Val 


Leu 


Ser 


Arg 


Met 


Gly 


Ala 


Val 


Pro 


Val 


Met 


Val 




370 










375 










380 










Pro 


Ala 


Gin 


Ser 


Gin 


Ala 


Gly 


Ser 


Leu 


Val 














385 










390 























:210> 149 
:211> 1846 
;212> DNA 

:213> Mus musculus 
:400> 149 

gtcgacccac gcgtccggtg cacatteggg ttgccgccgc tcacccacaa cacctgtaga 60 

caccgtgtgt ccaactctcc ctgagtactc egggecaagg agggecatga ttcttcaggc 120 

tggaaccccc gagaccagct tgctgcgggt tttgttcctg ggactgagta cccttgctgc 180 

cttctcccga gctcagatgg agttgcacgt gcccccgggc ctcaacaaat tggaagcggt 24 0 

agagggagaa gaagtggtgc tccccgcctg gtacacgatg geaegggagg agtcgtggtc 300 

ccacccccgg gaggtgecca tcctgatctg gttcttggaa caagaaggga aggaaccaaa 360 

ccaggtgttg tcttacatta atggagtcat gacaaataaa cctggaacag ccctggtcca 420 

ctctatctct teaeggaatg tgtccctgcg cctgggggca ctccaggagg gagactctgg 4 80 

gacttaccgc tgttctgtca atgtgcagaa tgatgaaggc aaaagtatag gccacagcat 54 0 

caaaagcata gagctcaaag tgctggttcc tccagctcct ccatcctgta gtttacaggg 600 

tgtaccctat gtegggacca atgtgaccct gaactgcaag tccccaagga gtaaacctac 660 

tgetcagtae cagtgggaga ggctggcccc atcctcccag gtcttctttg gaccagcctt 720 

agatgctgtt cgtggatctt taaagctcac taacctttcc attgecatgt ctggagtcta 780 

tgtctgcaag gctcaaaaca gagtgggctt tgccaagtgc aacgtgacct tggacgtgat 84 0 

gacagggtcc aaggctgcag tggtcgctgg agcagttgtg ggcacttttg ttgggttggt 900 

gctgatagct gggctggtcc tgttgtacca gcgccggagc aagaccttgg aagagctggc 960 

caatgatatc aaggaagatg ccattgctcc ccggaccttg ccttggacca aaggctcaga 1020 

cacaatctcc aagaatggga cactttcttc ggtcacctca gcacgagctc tgcggccacc 1080 

caaggctget cctccaagac ctggcacatt tactcccaca cccagtgtct ctagccaggc 1140 

cctgtcctca ccaagactgc ccagggtaga tgaaccccca cctcaggcag tgtccctgac 1200 

cccaggtggg gtttcttctt ctgttctgag ccgcatgggt gctgtgcctg tgatggtgcc 1260 

tgcacagagt caggctgggt ctcttgtgtg atageccagg cactcattag ctacatctgg 132 0 

tatctgacct ttctgtaaag gtctccttgt ggcacagagg actcaatctt gggaggatgc 1380 

ccacattcta gacctccagt cctttgctcc tacctccttc tattgttgga atactgggee 1440 

tcagtaagac taaaatctgg gtcaaaggac aaaaggagga aatggacctg aggtaggggg 1500 

ttgggagtga ggaggcttca cttcctccct gcttctccct gaagecagat gaatgctgcg 1560 

gaagategge taccctccaa gggctctgga ggagactgee agtcagtgat gcccctggct 1620 

ctgtgatctg tacaacaccc ttatctaatg ctgtcctttg ccgttcgctc catctccctg 1680 

tattaatata acctgtcctg ctggcttggc tgggttttgt tgtagcaggg ggataggaaa 174 0 

gacattttaa aatctgactt gaaattgatg tttttgtttt tattttgeaa atttcaataa 1800 

agatacatcg catttgeatg gaaaaaaaaa aaaaaagggc ggcege 1846 

:210> 150 
:211> 245 
:212> PRT 

:213> Homo sapiens 
:400> 150 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
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1 








5 










10 










15 




Ala 


Pro 


Ser 


Pro 


Trp 


Thr 


Leu 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 








20 










25 










30 






Ser 


Pro 


Gly 


Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 


Gly 


Glu 


Asp 


Asn 






35 










40 










45 








Ser 


He 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 


Ser 


Cys 


Ser 


Ser 




50 










55 










60 










Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 








85 










90 










95 




Pro 


Glu 


Gin 


Leu Gly 


Ser 


Val 


Leu 


Lys 


Tyr 


Gin 


Ser 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Asp 


Asn 


Met 


Pro Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 


He 


Gly 


Leu 


Thr 


Val 




115 










120 










125 








Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cys 


Ser 


Cys 




130 










135 










140 










Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 










165 










170 










175 




Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 








180 










185 










190 






Pro 


Pro 


Gin 


Pro Gly 


Met 


Pro 


Ala 


Ala 


Pro 


Tyr 


Pro 


Thr 


Gin 


Tyr 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr 


Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 




210 










215 










220 










Leu 


Ala 


Gly 


Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 










230 










235 










240 


Pro 


Lys 


Ala 


Val 


Pro 

























245 



<210> 151 
<211> 1801 
<212> DNA 

<213> Homo sapiens 
<400> 151 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gtgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 4 80 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 
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cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 12 60 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 13 20 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a 1801 

<210> 152 
<211> 245 
<212> PRT 
<213> Homo sapiens 



<400> 152 



Met 


Arg 


Leu 


Phe 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Pro 


Ala 


Met 


Ala 


Ala 


Pro 


1 






5 










10 










15 




Ala 


Pro 


Ser 


Pro 
20 


Trp 


Thr 


Leu 


Ser 


Leu 
25 


Leu 


Leu 


Leu 


Leu 


Leu 
30 


Leu 


Pro 


Ser 


Pro 


Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 


Gly 


Glu 


Asp 


Asn 






35 










40 










45 








Ser 


He 
50 


Pro 


Glu 


Ser 


Cys 


Pro 
55 


Asp 


Phe 


Cys 


Cys 


Gly 
60 


Ser 


Cys 


Ser 


Ser 


Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 








85 










90 










95 




Pro 


Glu 


Gin 


Leu 
100 


Gly 


Ser 


Ala 


Leu 


Lys 
105 


Tyr 


Gin 


Ser 


Ser 


Leu 
110 


Asp 


Ser 


Asp 


Asn 


Met 
115 


Pro 


Gly 


Phe 


Gly 


Ala 
120 


Thr 


Val 


Ala 


lie 


Gly 
125 


Leu 


Thr 


Val 


Phe 


Val 
130 


Val 


Phe 


He 


Ala 


Thr 
135 


He 


He 


Val 


Cys 


Phe 
140 


Thr 


Cys 


Ser 


Cys 


Cys 


Cys 


Leu Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 
165 


Ala 


Val 


His 


Thr 


Ala 
170 


Tyr 


Pro 


Gin 


Pro 


Gin 
175 


Pro 


Val 


Ala 


Pro 


Ser 
180 


Tyr 


Pro 


Gly 


Pro 


Thr 
185 


Tyr 


Gin 


Gly 


Tyr 


His 
190 


Pro 


Met 


Pro 


Pro 


Gin 
195 


Pro 


Gly 


Met 


Pro 


Ala 
200 


Ala 


Pro 


Tyr 


Pro 


Thr 
205 


Gin 


Tyr 


Pro 


Pro 


Pro 


Tyr Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 




210 










215 










220 










Leu 


Ala 


Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 










230 










235 










240 


Pro 


Lys 


Ala 


Val 


Pro 

























245 



<210> 153 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 
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cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 3 60 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 54 0 

caacaccaca actactaccg cggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 7 80 

tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 114 0 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 

tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 

acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 

actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 

tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 

tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 

acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 

gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 16 80 

tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 

ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

a ~ 1801 

<210> 154 
<211> 245 
<212> PRT 
<213> Homo sapiens 

<400> 154 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 

15 10 15 

Ala Pro Ser Pro Trp Thr Leu Ser Leu Leu Leu Leu Leu Leu Leu Pro 

20 25 30 

Ser Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe Gly Glu Asp Asn 

35 40 45 

Ser lie Pro Glu Ser Cys Pro Asp Phe Cys Cys Gly Ser Cys Ser Ser 

50 55 60 

Gin Tyr Cys Cys Ser Asp Val Leu Lys Lys lie Gin Trp Asn Glu Glu 
65 70 75 80 

Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Ala His Pro Glu Thr 

85 90 95 

Pro Glu Gin Leu Gly Ser Ala Leu Lys Tyr Gin Ser Ser Leu Asp Ser 

100 105 110 

Asp Asn Met Pro Gly Phe Gly Ala Thr Val Ala lie Gly Leu Thr Val 

115 120 125 

Phe Val Val Phe He Ala Thr He He Val Cys Phe Thr Cys Ser Cys 

130 135 140 

Cys Cys Leu Tyr Lys Met Cys Cys Arg Pro Arg Pro Val Val Ser Asn 
145 150 155 160 

Thr Thr Thr Thr Thr Val Val His Thr Ala Tyr Pro Gin Pro Gin Pro 
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165 170 175 

Val Ala Pro Ser Tyr Pro Gly Pro Thr Tyr Gin Gly Tyr His Pro Met 

180 185 190 

Pro Pro Gin Pro Gly Met Pro Ala Val Pro Tyr Pro Thr Gin Tyr Pro 

195 200 205 

Pro Pro Tyr Leu Ala Gin Pro Thr Gly Pro Pro Ala Tyr His Glu Thr 

210 215 220 

Leu Ala Gly Ala Ser Gin Pro Pro Tyr Asn Pro Ala Tyr Met Asp Pro 
225 230 235 240 

Pro Lys Ala Val Pro 
245 

<210> 155 
<211> 1801 
<212> DNA 

<213> Homo sapiens 
<400> 155 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 
cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 
tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 
gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 240 
tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 
ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 
gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 
agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 4 80 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 54 0 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 
cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 
agcagtaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 
agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 
tcccccaaag gcagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 
catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 
ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 
gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 
aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 
acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 1140 
ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 
cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 
tctccatcag ctgtctgtct ggacggtccc actgtctttc ctgggactat gtagagggcc 1320 
acatgtattc actattcagg ctccagtggc ttccaggcca ggggcctctg tctactacac 1380 
actctggttt ctccctacag tgtcttttta cgattagcca aacatattgc ctgttttttg 1440 
tatccagatg tgtgataatt ggtgaggttg aaatccttgg ttcctggaga acaggaaacc 1500 
tgacctctga cagtccgttt cccttgacac cagcttcata gaatacctga ctcctgtact 1560 
acagtccagt ttgttccagt agcagggaca ccagggccag gggttatctg gaccaagggt 1620 
gggggtggag agcctggatg gtagctctgg accagatgtg aatgcctcca tattccctgt 1680 
tggttcctgt ttcactggct gttttagttt tgtgttaatt ggtgtttctg agcattcaaa 1740 
ctccgcaccc tcgtttataa taaatgaata tttggaaaaa aaaaaaaaaa aaaaaaaaaa 1800 
a 1801 

<210> 156 
<211> 245 
<212> PRT 

<213> Homo sapiens 
<400> 156 

Met Arg Leu Phe Val Arg Pro Ser Val Arg Pro Ala Met Ala Ala Pro 
15 10 15 
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Ala 


Pro 


Ser 


Pro 


Trp 


Thr 


Leu 


Ser 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 








20 










25 










30 






Ser 


Pro 


Gly Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 


Gly 


Glu 


Asp 


Asn 






35 










40 










45 








Ser 


lie 


Pro 


Glu 


Ser 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 


Ser 


Cys 


Ser 


Ser 




50 










55 










60 










Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Lys 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


65 










70 










75 










80 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Ala 


His 


Pro 


Glu 


Thr 










85 










90 










95 




Pro 


Glu 


Gin 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Tyr 


Gin 


Ser 


Ser 


Leu 


Asp 


Ser 








100 










105 










110 






Asp 


Asn 


Met 


Pro 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 


He 


Gly 


Leu 


Thr 


Val 






115 










120 










125 








Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


Val 


Cys 


Phe 


Thr 


Cys 


Ser 


Cys 




130 










135 










140 










Cys 


Cys 


Leu Tyr 


Lys 


Met 


Cys 


Cys 


Arg 


Pro 


Arg 


Pro 


Val 


Val 


Ser 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Thr 


Ala 


Tyr 


Pro 


Gin 


Pro 


Gin 


Pro 










165 










170 










175 




Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 








180 










185 










190 






Pro 


Pro 


Gin 


Pro 


Gly 


Met 


Pro 


Ala 


Ala 


Pro 


Tyr 


Pro 


Thr 


Gin 


Tyr 


Pro 






195 










200 










205 








Pro 


Pro 


Tyr Leu 


Ala 


Gin 


Pro 


Thr 


Gly 


Pro 


Pro 


Ala 


Tyr 


His 


Glu 


Thr 




210 










215 










220 










Leu 


Ala 


Gly Ala 


Ser 


Gin 


Pro 


Pro 


Tyr 


Asn 


Pro 


Ala 


Tyr 


Met 


Asp 


Pro 


225 










230 










235 










240 


Pro 


Lys 


Val 


Val 


Pro 

























245 



<210> 157 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<400> 157 

gtcgacccac gcgtccggcg gaggttgtgg ctgcaccgtg gtcctgggct tggtcctggg 60 

cttgatgcgt ctgtttgtcc gtccgtccgt ccgtcccgcc atggctgcgc cggcgccctc 120 

tccgtggacc ctttcgctgc tgctgttgtt gctactgccg tctccgggtg cccatggcga 180 

gctgtgcagg cccttcggtg aagacaattc gatcccagag tcctgtcctg acttctgttg 24 0 

tggctcctgt tccagccaat actgctgctc tgacgtgctg aagaaaatcc agtggaatga 300 

ggaaatgtgc cctgagccag agtccagcag attttccgcc cacccggaga caccagaaca 360 

gctgggttca gcgctgaagt atcagtccag tcttgacagt gacaacatgc cagggttcgg 420 

agcgaccgtg gccatcggcc tgaccgtctt cgtggtgttt atcgctacca tcattgtgtg 480 

ctttacctgc tcctgctgct gtctatataa gatgtgctgc cgcccacgac ctgtcgtgtc 540 

caacaccaca actactaccg tggttcacac cgcttaccct cagcctcaac ctgtggcccc 600 

cagctatcct ggaccaacat accagggcta ccatcccatg cccccccagc caggaatgcc 660 

agcagcaccc tacccaacgc agtaccctcc accctacctg gcccagccca cagggccacc 720 

agcctatcat gagacgttgg ctggagccag ccagcctcca tacaacccgg cctacatgga 780 

tcccccaaag gtagttccct gagcctgccc ccagcctctt tggctaacat ttgattatgt 840 

catgtgtgtg tgagtgctat gcagagttct ttactgctgt ctgtggtgcg tgtgccttgt 900 

ctagacatgt ggcttcctct gctgatgacc aggtaggcac aaatcttacc agtgctggtt 960 

gggaccaatc tgttttcttc ctcacttgaa attgtaattt ctgaaatttc aagtaaatta 1020 

aaaacaatag ggtaggaggt atttcccgct tcaccccaag gtgaccagcc atagcctgcc 1080 

acacatagga gagcaagctt tttgtgggtc catgtcctgc tttggggagt agccagctag 1140 

ctgctgctat gggtttattc ccagggcttg gctgcattta gctggacaga gaacaagggg 1200 

cctcagtggc agtgggtcag tgactgatgt cagagcacac taggcagaga gccccgtccg 1260 
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tctccatcag 
acatgtattc 
actctggttt 
tatccagatg 
tgacctctga 
acagtccagt 
gggggtggag 
tggttcctgt 
ctccgcaccc 
a 



ctgtctgtct 
actattcagg 
ctccctacag 
tgtgataatt 
cagtccgttt 
ttgttccagt 
agcctggatg 
ttcactggct 
tcgtttataa 



ggacggtccc actgtctttc ctgggactat 
ctccagtggc ttccaggcca ggggcctctg 
tgtcttttta cgattagcca aacatattgc 
ggtg^gttg aaatccttgg ttcctggaga 
cccttgacac cagcttcata gaatacctga 
agcagggaca ccagggccag gggttatctg 
gtagctctgg accagatgtg aatgcctcca 
gttttagttt tgtgttaatt ggtgtttctg 
taaatgaata tttggaaaaa aaaaaaaaaa 



gtagagggcc 
tctactacac 
ctgttttttg 
acaggaaacc 
ctcctgtact 
gaccaagggt 
tattccctgt 
agcattcaaa 
aaaaaaaaaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1801 



<210> 158 

<211> 213 

<212> PRT 

<213> Mus musculus 



<400> 158 



Met 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp 


Thr 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Pro 


Pro 


Pro 


Pro 


Gly 


Ala 


His 


Gly 


Glu 


Leu 


Cys 


Arg 


Pro 


Phe 








20 










25 










30 






Gly 


Glu 


Asp 


Asn 


Ser 


He 


Pro 


Val 


Phe 


Cys 


Pro 


Asp 


Phe 


Cys 


Cys 


Gly 






35 










40 










45 








Ser 


Cys 


Ser 


Asn 


Gin 


Tyr 


Cys 


Cys 


Ser 


Asp 


Val 


Leu 


Arg 


Lys 


He 


Gin 




50 










55 










60 










Trp 


Asn 


Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Thr 


65 










70 










75 










80 


Pro 


Ala 


Glu 


Glu 


Thr. 


Pro 


Glu 


His 


Leu 


Gly 


Ser 


Ala 


Leu 


Lys 


Phe 


Arg 










85 










90 










95 




Ser 


Ser 


Phe 


Asp 


Ser 


Asp 


Pro 


Met 


Ser 


Gly 


Phe 


Gly 


Ala 


Thr 


Val 


Ala 








100 










105 










110 






He 


Gly 


Val 


Thr 


Met 


Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


He 


Cys 






115 










120 










125 








Phe 


Thr 


Cys 


Ser 


Cys 


Cys 


Cys 


Leu 


Tyr 


Lys 


Met 


Cys 


Cys 


Pro 


Gin 


Arg 




130 










135 










14 0 










Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Ala 


Pro 


Tyr 


145 










150 










155 










160 


Pro 


Gin 


Pro 


Gin 


Pro 


Gin 


Pro 


Val 


Ala 


Pro 


Ser 


Tyr 


Pro 


Gly 


Pro 


Thr 










165 










170 










175 




Tyr 


Gin 


Gly 


Tyr 


His 


Pro 


Met 


Pro 


Pro 


Pro 


Ala 


Arg 


Asn 


Ala 


Ser 


Ser 








180 










185 










190 






Thr 


Leu 


Pro 


Asn 


Ala 


Val 


Pro 


Thr 


Thr 


Leu 


Pro 


Gly 


Pro 


Ala 


His 


Arg 






195 










200 










205 








Ala 


Ala 


Thr 


Leu 


Pro 

























210 



<210> 159 

<211> 1858 

<212> DNA 

<213> Mus musculus 

<400> 159 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 12 0 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 24 0 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 
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cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatg tttgtggtgt ttattgccac tatcatcatc tgcttcacct 4 80 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 114 0 

gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 132 0 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 160 
<211> 213 
<212> PRT 

<213> Mus musculus 
<400> 160 

Met Ala Ala Pro Ala Pro Ser Leu Trp Thr Leu Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Pro Pro Pro Pro Gly Ala His Gly Glu Leu Cys Arg Pro Phe 

20 25 30 

Gly Glu Asp Asn Ser lie Pro Val Phe Cys Pro Asp Phe Cys Cys Gly 

35 40 45 

Ser Cys Ser Asn Gin Tyr Cys Cys Ser Asp Val Leu Arg Lys lie Gin 

50 55 60 

Trp Asn Glu Glu Met Cys Pro Glu Pro Glu Ser Ser Arg Phe Ser Thr 
65 70 75 80 

Pro Ala Glu Glu Thr Pro Glu His Leu Gly Ser Ala Leu Lys Phe Arg 

85 90 95 

Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 

100 105 110 

He Gly Val Thr He Phe Val Val Phe He Val Thr lie He He Cys 

115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 

130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Val His Ala Pro Tyr 
145 150 155 160 

Pro Gin Pro Gin Pro Gin Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 

165 170 175 

Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 

180 185 190 

Thr Leu Pro Asn Ala Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 
195 200 205 
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Ala Ala Thr Leu Pro 
210 

<210> 161 
<211> 1858 
<212> DNA 

<213> Mus musculus 
<400> 161 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 
tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 
tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc ,Rn 
cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 
ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 
ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 
cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 
tcgccattgg cgtgaccatc tttgtggtgt ttattgtcac tatcatcatc tgcttcacct 
gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 
caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 
gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 
gcagcaccct acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 
ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 
tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 
gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 
catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 
caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 
aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 
taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 
gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 
caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 
tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 
tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 
cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 
ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 
ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 
gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 
gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 
ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 
tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 
tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 

<210> 162 
<211> 213 
<212> PRT 

<213> Mus musculus 
<400> 162 



35 

Ser Cys Sei 
50 

Trp Asn Gli 
65 



Pro 


Ala 


Pro 


Ser 


Leu 


Trp Thr 


Leu 


Leu 


Leu 


Leu 


Leu Leu 


5 








10 










15 


Pro 


Pro 


Pro Gly Ala 


His Gly 


Glu 


Leu 


Cys 


Arg 


Pro Phe 


20 










25 








30 




Asn 


Ser 


lie 


Pro 


val 


Phe Cys 


Pro Asp 


Phe Cys Cys Gly 










40 








45 




lie Gin 


Asn 


Gin 


Tyr 


Cys 


Cys 


Ser Asp 


Val 


Leu 


Arg 


Lys 








55 








60 






Ser Thr 
80 


Glu 


Met 


Cys 

70 


Pro 


Glu 


Pro Glu 


Ser 
75 


Ser 


Arg 


Phe 


Glu 


Thr 


Pro 


Glu 


His 


Leu Gly 


Ser 


Ala 


Leu 


Lys 


Phe Arg 



60 



180 
240 
300 
360 
420 
480 
540 
600 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1858 
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85 



90 95 



Ser Ser Phe Asp Ser Asp Pro Met Ser Gly Phe Gly Ala Thr Val Ala 

100 105 110 

lie Gly Val Thr lie Phe Val Val Phe lie Ala Thr lie lie lie Cys 

115 120 125 

Phe Thr Cys Ser Cys Cys Cys Leu Tyr Lys Met Cys Cys Pro Gin Arg 

130 135 140 

Pro Val Val Thr Asn Thr Thr Thr Thr Thr Val Ala His Ala Pro Tyr 



Pro Gin Pro Gin Pro oln Pro Val Ala Pro Ser Tyr Pro Gly Pro Thr 
165 



150 



170 175 



Tyr Gin Gly Tyr His Pro Met Pro Pro Pro Ala Arg Asn Ala Ser Ser 

180 I 85 190 

Thr Leu Pro Asn Ala Val Pro Thr Thr Leu Pro Gly Pro Ala His Arg 

195 200 205 

Ala Ala Thr Leu Pro 
210 

<210> 163 

<211> 1858 

<212> DNA 

<213> Mus musculus 

'gjcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc "ggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattStgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctSSK agaLa?tcg atcccagtgt tctgtcctga tttctgttgt 9£tcctgtt 40 

ccaaccaata clgctgctcg gacgtgctga ggaaaatcca gtggaatgag 9aaatgtgtc 300 

ctoaqccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatL agttttgaca gtgaccctat gtcagggttc ^agcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc ^gcttcacct 480 

qc?cctgc?g ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgJggctcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gcS^gg a^caacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 60 

Icagcaccc? acccaacgca gtacccacca ccctacctgg cccagcccac agggccgcca 720 

cccLccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 780 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgS gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

ca?glggc?t Ltct^ctgt tgaccaggta ggcgcaagtc ttaccagtgt 999tcgggac 960 

caacc?gttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

"taaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagcl agctttttgc gggtacagag caggctttgg ggggtaacca 9Ctagctgct 1140 

gc?aggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

faccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tScatSgct gtJSctgg atgtcactgt ccttcccggg gctgtataga 999«acatg 1320 

tgttcacLt "tcaggctcca ctgggggaat tttcctacct "gctggctt 99ctcctgct 13 80 

cccaqqccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt wo 

S2S ccaaaca^t tgcctgtttt ctgtctccag atgtgtgata attggtgtga 500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt ««9tagca 1620 

gogacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

cScggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca <*ggctgttt 1740 

taqc??tgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

SSca?tg gaLaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 164 
<211> 213 
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<212> PRT 

<213> Mus musculus 



<400> 164 



Met 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Trp 


Thr 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


1 








5 










10 










15 




Leu 


Leu 


Pro 


Pro 
20 


Pro 


Pro 


Gly 


Ala 


His 
25 


Gly 


Glu 


Leu 


Cys 


Arg 
30 


Pro 


Phe 


Gly 


Glu 


Asp 
35 


Asn 


Ser 


He 


Pro 


Val 
40 


Phe 


Cys 


Pro 


Asp 


Phe 
45 


Cys 


Cys 


Gly 


Ser 


Cys 
50 


Ser 


Asn 


Gin 


Tyr 


Cys 
55 


Cys 


Ser 


Asp 


Val 


Leu 
60 


Arg 


Lys 


He 


Gin 


Trp 


Asn 


Glu 


Glu 


Met 


Cys 


Pro 


Glu 


Pro 


Glu 


Ser 


Ser 


Arg 


Phe 


Ser 


Thr 


65 










70 










75 










80 


Pro 


Ala 


Glu 


Glu 


Thr 
85 


Pro 


Glu 


His 


Leu 


Gly 
90 


Ser 


Ala 


Leu 


Lys 


Phe 
95 


Arg 


Ser 


Ser 


Phe 


Asp 
100 


Ser 


Asp 


Pro 


Met 


Ser 
105 


Gly 


Phe 


Gly 


Ala 


Thr 
110 


Val 


Ala 


He 


Gly Val 


Thr 


He 


Phe 


Val 


Val 


Phe 


He 


Ala 


Thr 


He 


He 


He 


Cys 






115 










120 










125 








Phe 


Thr 
130 


Cys 


Ser 


Cys 


Cys 


Cys 
135 


Leu 


Tyr 


Lys 


Met 


Cys 
140 


Cys 


Pro 


Gin 


Arg 


Pro 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Val 


His 


Ala 


Pro 


Tyr 


145 










150 










155 










160 


Pro 


Gin 


Pro 


Gin 


Pro 
165 


Gin 


Pro 


Val 


Ala 


Pro 
170 


Ser 


Tyr 


Pro 


Gly 


Pro 
175 


Thr 


Tyr 


Gin Gly 


Tyr 


His 


Pro 


Met 


Pro 


Pro 


Pro 


Ala 


Arg 


Asn 


Ala 


Ser 


Ser 








180 










185 










190 






Thr 


Leu 


Pro 
195 


Asn 


Val 


Val 


Pro 


Thr 
200 


Thr 


Leu 


Pro 


Gly 


Pro 
205 


Ala 


His 


Arg 


Ala 


Ala 


Thr 


Leu 


Pro 

























210 



<210> 165 
c211> 1858 
<212> DNA 

<213> Mus musculus 
<400> 165 

gtcgacccac gcgtccgcgc ggaggttgcg gcggcaccgt ggtcttgggc ttggtccgtc 60 

tgttcgtccg tccgttggtc tgtcccgcca tggctgcgcc ggcgccctct ctgtggaccc 120 

tattgctgct gctgttgctg ctgccgccgc ctccgggtgc ccatggtgag ctgtgcaggc 180 

cctttggtga agacaattcg atcccagtgt tctgtcctga tttctgttgt ggttcctgtt 240 

ccaaccaata ctgctgctcg gacgtgctga ggaaaatcca gtggaatgag gaaatgtgtc 300 

ctgagccaga gtccagcaga ttttccaccc ccgcggagga gacacccgaa catctgggtt 360 

cagcgctgaa atttcgatcc agttttgaca gtgaccctat gtcagggttc ggagcgaccg 420 

tcgccattgg cgtgaccatc tttgtggtgt ttattgccac tatcatcatc tgcttcacct 4 80 

gctcctgctg ctgtctgtat aagatgtgct gcccccaacg ccctgtcgtg accaacacca 540 

caactactac cgtggttcat gccccttacc ctcagcctca acctcaacct gtggccccca 600 

gctatcctgg accaacatac cagggctacc atcccatgcc ccccccagcc aggaatgcca 660 

gcagcaccct acccaacgta gtacccacca ccctacctgg cccagcccac agggccgcca 72 0 

ccctaccatg agtccttggc tggagccagc cagcctccat acaacccgac ctacatggat 78 0 

tccctaaaga caattccctg aacctgcccc cagcctcttt ggctgccatt tatgtcgtgt 840 

gtgagtgagt gatacgcaga gttctttact gctgtctgtg gtgtgtgtgc cttgtctaga 900 

catgtggctt cctctgctgt tgaccaggta ggcgcaagtc ttaccagtgt gggtcgggac 960 

caacctgttt tcttcctcac ttgaaattgt actttctgaa atttcaagca aattaaaaac 1020 

aataaggtag gaggtatttc ccacgtcacc ccaaggtgac cagccatggc ctgtcatact 1080 

taggagagca agctttttgc gggtacagag caggctttgg ggggtaacca gctagctgct 1140 
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gctaggcctt tattcccagg gtttggctgc attggcagtg aggcaggtgg ctgggggtga 1200 

caccaggtga caaggggact cagtggcagg gggtcacacc aggcagaaca ccatacactc 1260 

tccatcagct gtctgtctgg atgtcactgt ccttcccggg gctgtataga gggccacatg 1320 

tgttcactat tcaggctcca ctgggggaat tttcctacct ttgctggctt ggctcctgct 1380 

cccaggccag ggacctcggt ctgtctacta cacactctgg tttctccctg cactgtcttt 1440 

ttactgttag ccaaacattt tgcctgtttt ctgtctccag atgtgtgata attggtgtga 1500 

ggttgaaatc cctggttcct ggaggacaga caacctgacc tccgactgtc agtttccctt 1560 

gacaccatct tcatagaaat acctgactcc tgtaccacag tccagtttgt cccagtagca 1620 

gggacaccaa ggccaatggg ttatctggac caaaggtggg gtggagggcc tagatggtat 1680 

ctccggccca gatgtgaata cctccatatt ccctgttggt tcctgtttca ctggctgttt 1740 

tagctttgtg ttgattggtg tttctgagca ttcagactcc gcaccctcat ttctaataaa 1800 

tgcaacattg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1858 

<210> 166 
<211> 639 
<212> DNA 
<213> Mus musculus 



<400> 166 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 12 0 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 3 00 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat gtttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 167 
<211> 639 
<212> DNA 
<213> Mus musculus 



<400> 167 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgtca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 480 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 54 0 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 168 
<211> 639 
<212> DNA 

<213> Mus musculus 



<400> 168 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 
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aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 300 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 360 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggctca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 540 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgc agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 63 9 



<210> 169 

<211> 639 

<212> DNA 

<213> Mus musculus 



<400> 169 

atggctgcgc cggcgccctc tctgtggacc ctattgctgc tgctgttgct gctgccgccg 60 

cctccgggtg cccatggtga gctgtgcagg ccctttggtg aagacaattc gatcccagtg 120 

ttctgtcctg atttctgttg tggttcctgt tccaaccaat actgctgctc ggacgtgctg 180 

aggaaaatcc agtggaatga ggaaatgtgt cctgagccag agtccagcag attttccacc 240 

cccgcggagg agacacccga acatctgggt tcagcgctga aatttcgatc cagttttgac 3 00 

agtgacccta tgtcagggtt cggagcgacc gtcgccattg gcgtgaccat ctttgtggtg 3 60 

tttattgcca ctatcatcat ctgcttcacc tgctcctgct gctgtctgta taagatgtgc 420 

tgcccccaac gccctgtcgt gaccaacacc acaactacta ccgtggttca tgccccttac 4 80 

cctcagcctc aacctcaacc tgtggccccc agctatcctg gaccaacata ccagggctac 54 0 

catcccatgc cccccccagc caggaatgcc agcagcaccc tacccaacgt agtacccacc 600 

accctacctg gcccagccca cagggccgcc accctacca 639 

<210> 170 
<211> 1218 
<212> DNA 

<213> Homo sapiens 



<400> 170 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 12 0 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 24 0 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg cagactatct ggagacccag 3 00 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 3 60 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 4 20 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 4 80 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 540 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 7 80 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 1020 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 12 00 

aagaaagagg aggaggtt 1218 



<210> 171 
<211> 1218 
<212> DNA 
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<213> Homo sapiens 



<400> 171 



atggggccca 
ggacagcagc 
ctggcccagt 
aagatgctgc 
gacaccatct 
aacccagctc 
ggcaagggaa 
caagtgagat 
gatccactgg 
tttgtcttcc 
gtccgggtgc 
tttgctcgga 
ctaatcaaat 
gggctgatcc 
gaaggtcttt 
ttagatccac 
gctgaggctg 
agtcgggccc 
gcactccctt 
gaacgccagc 
aagaaagagg 



gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 
accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 
gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 
cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 
ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 
tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 
gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 
caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 
ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagcc 
caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 
ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 
ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 
tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 
ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 
gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 
agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 
cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 
gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 
attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 
tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 
aggaggtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1218 



<210> 172 

<211> 1219 

<212> DNA 

<213> Homo sapiens 

<400> 172 

catggggccc agcacccctc tcctcatctt gttccttttg tcatggtcgg gacccctcca 60 

aggacagcag caccaccttg tggagtacat ggaacgccga ctagctgctt tagaggaacg 12 0 

gctggcccag tgccaggacc agagtagtcg gcatgctgct gagctgcggg acttcaagaa 180 

caagatgctg ccactgctgg aggtggcaga gaaggagcgg gaggcactca gaactgaggc 24 0 

cgacaccatc tccgggagag tggatcgtct ggagcgggag gtagactatc tggagaccca 300 

gaacccagct ctgccctgtg tagagtttga tgagaaggtg actggaggcc ctgggaccaa 360 

aggcaaggga agaaggaatg agaagtacga tatggtgaca gactgtggct acacaatctc 420 

tcaagtgaga tcaatgaaga ttctgaagcg atttggtggc ccagctggta tatggaccaa 4 80 

ggatccactg gggcaaacag agaagatcta cgtgttagat gggacacaga atgacacagc 54 0 

ctttgtcttc ccaaggctgc gtgacttcac ccttgccatg gctgcccgga aagcttcccg 600 

agtccgggtg cccttcccct gggtaggcac agggcagctg gtatatggtg gctttcttta 660 

ttttgctcgg aggcctcctg gaagacctgg tggaggtggt gagatggaga acactttgca 720 

gctaatcaaa ttccacctgg caaaccgaac agtggtggac agctcagtat tcccagcaga 780 

ggggctgatc cccccctacg gcttgacagc agacacctac atcgacctgg cagctgatga 84 0 

ggaaggtctt tgggctgtct atgccacccg ggaggatgac aggcacttgt gtctggccaa 900 

gttagatcca cagacactgg acacagagca gcagtgggac acaccatgtc ccagagagaa 960 

tgctgaggct gcctttgtca tctgtgggac cctctatgtc gtctataaca cccgtcctgc 1020 

cagtcgggcc cgcatccagt gctcctttga tgccagcggc accctgaccc ctgaacgggc 108 0 

agcactccct tattttcccc gcagatatgg tgcccatgcc agcctccgct ataacccccg 1140 

agaacgccag ctctatgcct gggatgatgg ctaccagatt gtctataagc tggagatgag 1200 

gaagaaagag gaggaggtt 1219 

<210> 173 

<211> 1218 

<212> DNA 

<213> Homo sapiens 
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<400> 173 

atggggccca gcacccctct cctcatcttg ttccttttgt catggtcggg acccctccaa 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcttt agaggaacgg 120 

ctggcccagt gccaggacca gagtagtcgg catgctgctg agctgcggga cttcaagaac 180 

aagatgctgc cactgctgga ggtggcagag aaggagcggg aggcactcag aactgaggcc 240 

gacaccatct ccgggagagt ggatcgtctg gagcgggagg tagactatct ggagacccag 300 

aacccagctc tgccctgtgt agagtttgat gagaaggtga ctggaggccc tgggaccaaa 360 

ggcaagggaa gaaggaatga gaagtacgat atggtgacag actgtggcta cacaatctct 42 0 

caagtgagat caatgaagat tctgaagcga tttggtggcc cagctggtct atggaccaag 48 0 

gatccactgg ggcaaacaga gaagatctac gtgttagatg ggacacagaa tgacacagtc 54 0 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

gtccgggtgc ccttcccctg ggtaggcaca gggcagctgg tatatggtgg ctttctttat 660 

tttgctcgga ggcctcctgg aagacctggt ggaggtggtg agatggagaa cactttgcag 720 

ctaatcaaat tccacctggc aaaccgaaca gtggtggaca gctcagtatt cccagcagag 780 

gggctgatcc ccccctacgg cttgacagca gacacctaca tcgacctggc agctgatgag 840 

gaaggtcttt gggctgtcta tgccacccgg gaggatgaca ggcacttgtg tctggccaag 900 

ttagatccac agacactgga cacagagcag cagtgggaca caccatgtcc cagagagaat 960 

gctgaggctg cctttgtcat ctgtgggacc ctctatgtcg tctataacac ccgtcctgcc 102 0 

agtcgggccc gcatccagtg ctcctttgat gccagcggca ccctgacccc tgaacgggca 1080 

gcactccctt attttccccg cagatatggt gcccatgcca gcctccgcta taacccccga 1140 

gaacgccagc tctatgcctg ggatgatggc taccagattg tctataagct ggagatgagg 1200 

aagaaagagg aggaggtt 1218 

<210> 174 
<211> 729 
<212> DNA 

<213> Mus mus cuius 



<400> 174 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 12 0 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 24 0 

gggccgcgtg gagaggtagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 42 0 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 4 80 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 



<210> 175 

<211> 729 

<212> DNA 

<213> Mus musculus 



<400> 175 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagtcttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 360 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 480 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 
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tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 176 
<211> 729 
<212> DNA 
<213> Mus musculus 

<400> 176 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 12 0 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 24 0 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 3 00 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 3 60 

acacccctac ctttcgaccg tgcgctgcta aatgagcagg gccattacga ccccactact^ 420 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 4 80 

cgggccagct tgcagtttga tcttgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 177 
<211> 729 
<212> DNA 
<213> Mus musculus 

<400> 177 

atgaggccac ttcttgccct tctgcttctg ggtctggtgt caggctctcc tcctctggac 60 

gacaacaaga tccccagcct gtgtcccggg cagcccggcc ttccaggcac accaggtcac 120 

catggcagcc aaggcctgcc tggccgtgac ggccgtgatg gccgcgacgg tgcacccgga 180 

gctccgggag agaaaggcga gggcgggaga ccgggactac ctggcccacg tggggagccc 240 

gggccgcgtg gagaggcagg gcccatgggg gctatcgggc ctgcggggga gtgctcggta 300 

cccccacgat cagccttcag tgccaagcga tccgagagcc gggtacctcc gccagccgac 3 60 

acacccctac ctttcgaccg tgtgctgcta aatgagcagg gccattacga ccccactact 42 0 

ggcaagttca cctgccaagt gcctggcgtc tactactttg ctgtgcacgc cactgtctac 4 80 

cgggccagct tgcagtttga tattgtcaaa aacgggcagt ccatcgcctc tttcttccag 54 0 

tattttgggg ggtggcccaa gccagcctcg ctctcagggg gtgcgatggt aaggctagaa 600 

cctgaggacc aggtgtgggt gcaggtgggc gtgggtgatt acattggcat ctatgccagc 660 

atcaagacag acagtacctt ctctggattt ctcgtctatt ctgactggca cagctcccca 720 

gtcttcgct 729 

<210> 178 
<211> 1218 
<212> DNA 

<213> Mus musculus 
<400> 178 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 24 0 

gactccatct caggaagagt ggaccgtatt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 3 60 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 
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caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 54 0 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 1218 

<210> 179 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 179 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 24 0 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 3 60 

ggcaagggcc gaagaaatga gaaatacgat atagtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 54 0 

tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 60 0 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 78 0 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 84 0 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 102 0 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 1218 

<210> 180 

<211> 1218 

<212> DNA 

<213> Mus musculus 

<400> 180 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 4 20 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 4 80 

gatccgctgg ggccagcaga gaagatctac gcgttagacg gcacccagaa cgacacggct 540 
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600 



tttgtcttcc caaggctgcg tgacttcacc cttgccatgg ctgcccggaa agcttcccga 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt.gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag. cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 114 0 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 12 00 



aagaaggagg aggaagtt 1218 

<210> 181 
<211> 1218 
<212> DNA 

<213> Mus musculus 
<400> 181 

atggggccca gtgctcctct gctgctcctc ttctttttgt catggacggg accccttcag 60 

ggacagcagc accaccttgt ggagtacatg gaacgccgac tagctgcctt agaggaacgg 120 

ctggcccaat gccaggatca gagtagtcgg catgctgccg agcttcggga cttcaaaaac 180 

aagatgttgc ctctcctgga ggtggcagag aaggagcggg agaccctcag aactgaagca 240 

gactccatct caggaagagt ggaccgtctt gaaagggagg tagactatct ggagacacag 300 

aacccagctt tgccctgtgt agagctggat gagaaggtga ctggaggtcc tggagccaaa 360 

ggcaagggcc gaagaaatga gaaatacgat atggtgacgg actgtagcta cacagtcgct 420 

caggtgaggt caatgaagat cctgaagcgg tttggtggtt cagttggcct atggaccaag 480 

gatccgctgg ggccagcaga gaagatctac gtgttagacg gcacccagaa cgacacggct 54 0 

tttgtcttcc caaggctgcg tgacttcacc cttgtcatgg ctgcccggaa agcttcccga 600 

attcgggtgc ccttcccctg ggtaggcacg gggcagctgg tgtacggtgg cttcctttat 660 

tatgctcgaa ggcctcctgg aggacctgga gggggtggtg aattggagaa cactctgcag 720 

ctgatcaaat ttcacttggc aaaccgaaca gtggtggata gctcagtgtt ccctgcagag 780 

agcctgatac ccccctacgg cctgacagca gatacatata tcgacctggc agctgatgag 840 

gagggcctgt gggctgtcta tgccactcga gatgatgaca ggcatttgtg tctagccaag 900 

ttagacccac agacacttga cacagagcag cagtgggaca caccatgtcc cagagagaac 960 

gcagaggctg cgtttgtcat ctgtgggacc ctgtacgttg tctataacac ccgccctgcc 1020 

agtagggctc gtattcagtg ttccttcgat gccagtggta ctctcgcccc tgaaagggca 1080 

gcactctcct attttccacg ccgatatggt gcccatgcca gccttcgcta taacccccgt 1140 

gagcgccagc tgtatgcctg ggatgatggc taccagattg tctacaaatt ggagatgaag 1200 

aagaaggagg aggaagtt 1218 

<210> 182 
<211> 1110 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 182 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 12 0 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggcg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 
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tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 4 80 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 54 0 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 
ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 
tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 



900 
960 



<210> 183 

<211> 1110 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



<400> 183 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 3 00 

gcctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 4 80 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 54 0 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 7 80 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 
ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 
tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt H 10 

<210> 184 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 



900 
960 



<400> 184 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 
agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 
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caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 300 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtatc 480 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 54 0 

cccgctgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 185 
<211> 1110 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> modif ied_base 
<222> all "n" positions 
<223> n=a, c, g, or t 

<400> 185 

atgatttccc tcccggggcc cctggtgacc aacttgntgc ggtttttgtt cctggggctg 60 

agtgccctcg cgcccccctc gcgggcccag ctgcaactgc acttgcccgc caaccggttg 120 

caggcggtgg aggaggggga aagtggtgct tcagcatggt acaccttgca cagggaggtg 180 

tcttcatccc agccatggga ggtgcccttt gtgatgtggt tcttcaaaca gaaagaaaag 240 

gaggatcagg tgttgtccta catcaatggg gtcacaacaa gcaaacctgg agtatccttg 3 00 

gtctactcca tgccctcccg gaacctgtcc ctgcgggtgg agggtctcca ggagaaagac 360 

tctggcccct acagctgctc cgtgaatgtg caagacaaac aaggcaaatc taggggccac 420 

agcatcaaaa ccttagaact caatgtactg gttcctccag ctcctccatc ctgccgtctc 4 80 

cagggtgtgc cccatgtggg ggcaaacgtg accctgagct gccagtctcc aaggagtaag 540 

cccgttgtcc aataccagtg ggatcggcag cttccatcct tccagacttt ctttgcacca 600 

gcattagatg tcatccgtgg gtctttaagc ctcaccaacc tttcgtcttc catggctgga 660 

gtctatgtct gcaaggccca caatgaggtg ggcactgccc aatgtaatgt gacgctggaa 720 

gtgagcacag ggcctggagc tgcagtggtt gctgaagctg ttgtgggtac cctggttgga 780 

ctggggttgc tggctgggct ggtcctcttg taccaccgcc ggggcaaggc cctggaggag 840 

ccagccaatg atatcaagga ggatgccatt gctccccgga ccctgccctg gcccaagagc 900 

tcagacacaa tctccaagaa tgggaccctt tcctctgtca cctccgcacg agccctccgg 960 

ccaccccatg gccctcccag gcctggtgca ttgaccccca cgcccagtct atccagccag 1020 

gccctgccct caccaagaca tgcccacgac agatggggcc caccctcaac caatatcccc 1080 

catccctggt ggggtttttt cctttggctt 1110 

<210> 186 
<211> 1182 
<212> DNA 

<213> Mus musculus 
<400> 186 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 
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gaggagtcgt ggtcccaccc ccgggaggtg cccatcatga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 42 0 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 72 0 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 114 0 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 187 

<211> 1182 

<212> DNA 

<213> Mus musculus 



<400> 187 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 42 0 

ataggccaca gcatcaaaag catagagctc aaagcgctgg ttcctccagc tcctccatcc 4 80 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 54 0 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 78 0 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 84 0 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 114 0 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 188 

<211> 1182 

<212> DNA 

<213> Mus musculus 



<400> 188 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 12 0 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 24 0 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 300 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 
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ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg tcccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 1020 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgctc tgagccgcat gggtgctgtg 114 0 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 

<210> 189 

<211> 1182 

<212> DNA 

<213> Mus tnusculus 



<400> 189 

atgattcttc aggctggaac ccccgagacc agcttgctgc gggttttgtt cctgggactg 60 

agtacccttg ctgccttctc ccgagctcag atggagttgc acgtgccccc gggcctcaac 120 

aaattggaag cggtagaggg agaagaagtg gtgctccccg cctggtacac gatggcacgg 180 

gaggagtcgt ggtcccaccc ccgggaggtg cccatcctga tctggttctt ggaacaagaa 240 

gggaaggaac caaaccaggt gttgtcttac attaatggag tcatgacaaa taaacctgga 3 00 

acagccctgg tccactctat ctcttcacgg aatgtgtccc tgcgcctggg ggcactccag 360 

gagggagact ctgggactta ccgctgttct gtcaatgtgc agaatgatga aggcaaaagt 420 

ataggccaca gcatcaaaag catagagctc aaagtgctgg ttcctccagc tcctccatcc 480 

tgtagtttac agggtgtacc ctatgtcggg accaatgtga ccctgaactg caagtcccca 540 

aggagtaaac ctactgctca gtaccagtgg gagaggctgg ccccatcctc ccaggtcttc 600 

tttggaccag ccttagatgc tgttcgtgga tctttaaagc tcactaacct ttccattgcc 660 

atgtctggag tctatgtctg caaggctcaa aacagagtgg gctttgccaa gtgcaacgtg 720 

accttggacg tgatgacagg gtccaaggct gcagtggtcg ctggagcagt tgtgggcact 780 

tttgttgggt tggtgctgat agctgggctg gtcctgttgt accagcgccg gagcaagacc 840 

ttggaagagc tggccaatga tatcaaggaa gatgccattg ctccccggac cttgccttgg 900 

accaaaggct cagacacaat ctccaagaat gggacacttt cttcggtcac ctcagcacga 960 

gctctgcggc cacccaaggc tgctcctcca agacctggca catttactcc cacacccagt 102 0 

gtctctagcc aggccctgtc ctcaccaaga ctgcccaggg tagatgaacc cccacctcag 1080 

gcagtgtccc tgaccccagg tggggtttct tcttctgttc tgagccgcat gggtgctgtg 114 0 

cctgtgatgg tgcctgcaca gagtcaggct gggtctcttg tg 1182 



<210> 190 
<211> 735 
<212> DMA 
<213> Homo sapiens 



<400> 190 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 24 0 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 300 

ggttcagtgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 
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tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 
ccaaaggcag ttccc 735 

<210> 191 

<211> 735 

<212> DNA 

<213> Homo sapiens 



60 



<400> 191 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 3 00 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 360 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgcggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gcaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 

ccaaaggcag ttccc 735 

<210> 192 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 192 

atgcgtctgt ttgtccgtcc gtccgtccgt cccgccatgg ctgcgccggc gccctctccg 60 

tggacccttt cgctgctgct gttgttgcta ctgccgtctc cgggtgccca tggcgagctg 120 

tgcaggccct tcggtgaaga caattcgatc ccagagtcct gtcctgactt ctgttgtggc 180 

tcctgttcca gccaatactg ctgctctgac gtgctgaaga aaatccagtg gaatgaggaa 240 

atgtgccctg agccagagtc cagcagattt tccgcccacc cggagacacc agaacagctg 3 00 

ggttcagcgc tgaagtatca gtccagtctt gacagtgaca acatgccagg gttcggagcg 3 60 

accgtggcca tcggcctgac cgtcttcgtg gtgtttatcg ctaccatcat tgtgtgcttt 420 

acctgctcct gctgctgtct atataagatg tgctgccgcc cacgacctgt cgtgtccaac 480 

accacaacta ctaccgtggt tcacaccgct taccctcagc ctcaacctgt ggcccccagc 540 

tatcctggac caacatacca gggctaccat cccatgcccc cccagccagg aatgccagca 600 

gtaccctacc caacgcagta ccctccaccc tacctggccc agcccacagg gccaccagcc 660 

tatcatgaga cgttggctgg agccagccag cctccataca acccggccta catggatccc 720 
ccaaaggcag ttccc 

<210> 193 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 

<400> 193 
caaagtgagc tcatgctctc ac 

<210> 194 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> reverse primer 

<400> 194 
ctctggtctt gggcagaaat c 

<210> 195 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 

<400> 195 
ggagtatcct tggtctactc c 

<210> 196 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 

<400> 196 
gaaagtctgg aaggatggaa get 

<210> 197 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 

<400> 197 
ggatgatggc taccagattg tc 

<210> 198 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 

<400> 198 
ggaacattga gggttttgac tc 
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